Long- and short-range interactions in native protein structures are consistent/minimally frustrated in sequence space

被引:17
作者
Miyazawa, S [1 ]
Jernigan, RL
机构
[1] Gunma Univ, Fac Technol, Gunma 3768515, Japan
[2] NCI, Lab Expt & Computat Biol, CCR, NIH, Bethesda, MD 20892 USA
[3] Iowa State Univ, Laurence H Baker Ctr Bioinformat & Biol Stat, Inst Plant Sci, Ames, IA USA
来源
PROTEINS-STRUCTURE FUNCTION AND GENETICS | 2003年 / 50卷 / 01期
关键词
empirical potentials; inverse protein folding; protein folding; protein sequence design; protein sequence-structure compatibility;
D O I
10.1002/prot.10242
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We show that long- and short-range interactions in almost all protein native structures are actually consistent with each other for coarse-grained energy scales; specifically we mean the long-range inter-residue contact energies and the short-range secondary structure energies based on peptide dihedral angles, which are potentials of mean force evaluated from residue distributions observed in protein native structures. This consistency is observed at equilibrium in sequence space rather than in conformational. space. Statistical ensembles of sequences are generated by exchanging residues for each of 797 protein native structures with the Metropolis method. It is shown that adding the other category of interaction to either the short- or long-range interactions decreases the means and variances of those energies for essentially all protein native structures, indicating that both interactions consistently work by more-or-less restricting sequence spaces available to one of the interactions. In addition to this consistency, independence by these interaction classes is also indicated by the fact that there are almost no correlations between them when equilibrated using both interactions and significant but small, positive correlations at equilibrium using only one of the interactions. Evidence is provided that protein native sequences can be regarded approximately as samples from the statistical ensembles of sequences with these energy scales and that all proteins have the same effective conformational temperature. Designing protein structures and sequences to be consistent and minimally frustrated among the various interactions is a most effective way to increase protein stability and foldability.
引用
收藏
页码:35 / 43
页数:9
相关论文
共 27 条
[1]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[2]   FUNNELS, PATHWAYS, AND THE ENERGY LANDSCAPE OF PROTEIN-FOLDING - A SYNTHESIS [J].
BRYNGELSON, JD ;
ONUCHIC, JN ;
SOCCI, ND ;
WOLYNES, PG .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 1995, 21 (03) :167-195
[3]   SPIN-GLASSES AND THE STATISTICAL-MECHANICS OF PROTEIN FOLDING [J].
BRYNGELSON, JD ;
WOLYNES, PG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (21) :7524-7528
[4]   New algorithm for protein design [J].
Deutsch, JM ;
Kurosky, T .
PHYSICAL REVIEW LETTERS, 1996, 76 (02) :323-326
[5]   A METHOD TO CONFIGURE PROTEIN SIDE-CHAINS FROM THE MAIN-CHAIN TRACE IN HOMOLOGY MODELING [J].
EISENMENGER, F ;
ARGOS, P ;
ABAGYAN, R .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 231 (03) :849-860
[6]   WHY DO PROTEIN ARCHITECTURES HAVE BOLTZMANN-LIKE STATISTICS [J].
FINKELSTEIN, AV ;
BADRETDINOV, AY ;
GUTIN, AM .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (02) :142-150
[7]   THEORETICAL-STUDIES OF PROTEIN FOLDING [J].
GO, N .
ANNUAL REVIEW OF BIOPHYSICS AND BIOENGINEERING, 1983, 12 :183-210
[8]   EQUATION OF STATE CALCULATIONS BY FAST COMPUTING MACHINES [J].
METROPOLIS, N ;
ROSENBLUTH, AW ;
ROSENBLUTH, MN ;
TELLER, AH ;
TELLER, E .
JOURNAL OF CHEMICAL PHYSICS, 1953, 21 (06) :1087-1092
[9]   Universality and diversity of the protein folding scenarios: A comprehensive analysis with the aid of a lattice model [J].
Mirny, LA ;
Abkevich, V ;
Shakhnovich, EI .
FOLDING & DESIGN, 1996, 1 (02) :103-116
[10]  
Miyazawa S, 1999, PROTEINS, V36, P347, DOI 10.1002/(SICI)1097-0134(19990815)36:3<347::AID-PROT9>3.0.CO