Native protein sequences are close to optimal for their structures

被引:646
作者
Kuhlman, B
Baker, D [1 ]
机构
[1] Univ Washington, Sch Med, Dept Biochem, Seattle, WA 98195 USA
[2] Univ Washington, Sch Med, Howard Hughes Med Inst, Seattle, WA 98195 USA
关键词
D O I
10.1073/pnas.97.19.10383
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
How large is the volume of sequence space that is compatible with a given protein structure? starting from random sequences, low free energy sequences were generated for 108 protein backbone structures by using a Monte Carlo optimization procedure and a free energy function based primarily on Lennard-Jones packing interactions and the Lazaridis-Karplus implicit solvation model. Remarkably, in the designed sequences 51% of the core residues and 27% of all residues were identical to the amino acids in the corresponding positions in the native sequences. The lowest free energy sequences obtained for ensembles of native-like backbone structures were also similar to the native sequence. Furthermore. both the individual residue frequencies and the covariances between pairs of positions observed in the very large SH3 domain family were recapitulated in core sequences designed for SH3 domain structures. Taken together, these results suggest that the volume of sequence space optimal for a protein structure is surprisingly restricted to a region around the native sequence.
引用
收藏
页码:10383 / 10388
页数:6
相关论文
共 24 条
[1]   De novo protein design: Fully automated sequence selection [J].
Dahiyat, BI ;
Mayo, SL .
SCIENCE, 1997, 278 (5335) :82-87
[2]   DE-NOVO DESIGN OF THE HYDROPHOBIC CORES OF PROTEINS [J].
DESJARLAIS, JR ;
HANDEL, TM .
PROTEIN SCIENCE, 1995, 4 (10) :2006-2018
[3]   THE DEAD-END ELIMINATION THEOREM AND ITS USE IN PROTEIN SIDE-CHAIN POSITIONING [J].
DESMET, J ;
DEMAEYER, M ;
HAZES, B ;
LASTERS, I .
NATURE, 1992, 356 (6369) :539-542
[4]   The HSSP database of protein structure sequence alignments and family profiles [J].
Dodge, C ;
Schneider, R ;
Sander, C .
NUCLEIC ACIDS RESEARCH, 1998, 26 (01) :313-315
[5]   Bayesian statistical analysis of protein side-chain rotamer preferences [J].
Dunbrack, RL ;
Cohen, FE .
PROTEIN SCIENCE, 1997, 6 (08) :1661-1681
[6]  
Gordon DB, 1998, J COMPUT CHEM, V19, P1505, DOI 10.1002/(SICI)1096-987X(199810)19:13<1505::AID-JCC7>3.0.CO
[7]  
2-U
[8]   Energy functions for protein design [J].
Gordon, DB ;
Marshall, SA ;
Mayo, SL .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (04) :509-513
[9]   High-resolution protein design with backbone freedom [J].
Harbury, PB ;
Plecs, JJ ;
Tidor, B ;
Alber, T ;
Kim, PS .
SCIENCE, 1998, 282 (5393) :1462-1467
[10]   De novo protein design.: II.: Plasticity in sequence space [J].
Koehl, P ;
Levitt, M .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 293 (05) :1183-1193