Emergence of protein fold families through rational design

被引:166
作者
Ding, Feng [1 ]
Dokholyan, Nikolay V. [1 ]
机构
[1] Univ N Carolina, Dept Biochem & Biophys, Chapel Hill, NC USA
关键词
D O I
10.1371/journal.pcbi.0020085
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Diverse proteins with similar structures are grouped into families of homologs and analogs, if their sequence similarity is higher or lower, respectively, than 20%-30%. It was suggested that protein homologs and analogs originate from a common ancestor and diverge in their distinct evolutionary time scales, emerging as a consequence of the physical properties of the protein sequence space. Although a number of studies have determined key signatures of protein family organization, the sequence-structure factors that differentiate the two evolution-related protein families remain unknown. Here, we stipulate that subtle structural changes, which appear due to accumulating mutations in the homologous families, lead to distinct packing of the protein core and, thus, novel compositions of core residues. The latter process leads to the formation of distinct families of homologs. We propose that such differentiation results in the formation of analogous families. To test our postulate, we developed a molecular modeling and design toolkit, Medusa, to computationally design protein sequences that correspond to the same fold family. We find that analogous proteins emerge when a backbone structure deviates only 1-2 angstrom root-mean-square deviation from the original structure. For close homologs, core residues are highly conserved. However, when the overall sequence similarity drops to; 25%-30%, the composition of core residues starts to diverge, thereby forming novel families of protein homologs. This direct observation of the formation of protein homologs within a specific fold family supports our hypothesis. The conservation of amino acids in designed sequences recapitulates that of the naturally occurring sequences, thereby validating our computational design methodology.
引用
收藏
页码:725 / 733
页数:9
相关论文
共 59 条
[1]   RAPID CALCULATION OF 1ST AND 2ND DERIVATIVES OF CONFORMATIONAL ENERGY WITH RESPECT TO DIHEDRAL ANGLES FOR PROTEINS - GENERAL RECURRENT EQUATIONS [J].
ABE, H ;
BRAUN, W ;
NOGUTI, T ;
GO, N .
COMPUTERS & CHEMISTRY, 1984, 8 (04) :239-247
[2]   Protein evolution - How far can sequences diverge? [J].
Chothia, C ;
Gerstein, M .
NATURE, 1997, 385 (6617) :579-&
[3]   PROTEINS - 1000 FAMILIES FOR THE MOLECULAR BIOLOGIST [J].
CHOTHIA, C .
NATURE, 1992, 357 (6379) :543-544
[4]   Probing the role of packing specificity in protein design [J].
Dahiyat, BI ;
Mayo, SL .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (19) :10172-10177
[5]   De novo protein design: Fully automated sequence selection [J].
Dahiyat, BI ;
Mayo, SL .
SCIENCE, 1997, 278 (5335) :82-87
[6]   DE-NOVO DESIGN OF THE HYDROPHOBIC CORES OF PROTEINS [J].
DESJARLAIS, JR ;
HANDEL, TM .
PROTEIN SCIENCE, 1995, 4 (10) :2006-2018
[7]   NEW STRATEGIES IN PROTEIN DESIGN [J].
DESJARLAIS, JR ;
HANDEL, TM .
CURRENT OPINION IN BIOTECHNOLOGY, 1995, 6 (04) :460-466
[8]   Simple but predictive protein models [J].
Ding, F ;
Dokholyan, NV .
TRENDS IN BIOTECHNOLOGY, 2005, 23 (09) :450-455
[9]   Direct molecular dynamics observation of protein folding transition state ensemble [J].
Ding, F ;
Dokholyan, NV ;
Buldyrev, SV ;
Stanley, HE ;
Shakhnovich, EI .
BIOPHYSICAL JOURNAL, 2002, 83 (06) :3525-3532
[10]   Discrete molecular dynamics studies of the folding of a protein-like model [J].
Dokholyan, NV ;
Buldyrev, SV ;
Stanley, HE ;
Shakhnovich, EI .
FOLDING & DESIGN, 1998, 3 (06) :577-587