Statistical theory for protein combinatorial libraries. Packing interactions, backbone flexibility, and the sequence variability of a main-chain structure

被引:102
作者
Kono, H [1 ]
Saven, JG [1 ]
机构
[1] Univ Penn, Dept Chem, Philadelphia, PA 19104 USA
基金
美国国家科学基金会;
关键词
protein design; combinatorial library; sequence variability; profile; protein L;
D O I
10.1006/jmbi.2000.4422
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Combinatorial experiments provide new ways to probe the determinants of protein folding and to identify novel folding amino acid sequences. These types of experiments, however, are complicated both by enormous conformational complexity and by large numbers of possible sequences. Therefore, a quantitative computational theory would be helpful in designing and interpreting these types of experiment. Here, we present and apply a statistically based, computational approach for identifying the properties of sequences compatible with a given main-chain structure. Protein side-chain conformations are included in an atom-based fashion. Calculations are performed for a variety of similar backbone structures to identify sequence properties that are robust with respect to minor changes in main-chain structure. Rather than specific sequences, the method yields the likelihood of each of the amino acids at preselected positions in a given protein structure. The theory may be used to quantify the characteristics of sequence space for a chosen structure without explicitly tabulating sequences. To account for hydrophobic effects, we introduce an environmental energy that it is consistent with other simple hydrophobicity scales and show that it is effective for side-chain modeling. We apply the method to calculate the identity probabilities of selected positions of the immunoglobulin light chain-binding domain of protein L, for which many variant folding sequences are available. The calculations compare favorably with the experimentally observed identity probabilities. (C) 2001 Academic Press.
引用
收藏
页码:607 / 628
页数:22
相关论文
共 123 条
[1]   Improved design of stable and fast-folding model proteins [J].
Abkevich, VI ;
Gutin, AM ;
Shakhnovich, EI .
FOLDING & DESIGN, 1996, 1 (03) :221-230
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Active barnase variants with completely random hydrophobic cores [J].
Axe, DD ;
Foster, NW ;
Fersht, AR .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (11) :5590-5594
[4]   Engineering and design - Editorial overview [J].
Baker, D ;
DeGrado, WF .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1999, 9 (04) :485-486
[5]   THE ROLE OF BACKBONE FLEXIBILITY IN THE ACCOMMODATION OF VARIANTS THAT REPACK THE CORE OF T4-LYSOZYME [J].
BALDWIN, EP ;
HAJISEYEDJAVADI, O ;
BAASE, WA ;
MATTHEWS, BW .
SCIENCE, 1993, 262 (5140) :1715-1718
[6]  
Barker WC, 1996, METHOD ENZYMOL, V266, P59
[7]   DE-NOVO PROTEIN DESIGN - FROM MOLTEN GLOBULES TO NATIVE-LIKE STATES [J].
BETZ, SF ;
RALEIGH, DP ;
DEGRADO, WF .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1993, 3 (04) :601-610
[8]   Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: A new homology modeling tool [J].
Bower, MJ ;
Cohen, FE ;
Dunbrack, RL .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (05) :1268-1282
[9]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[10]   SPIN-GLASSES AND THE STATISTICAL-MECHANICS OF PROTEIN FOLDING [J].
BRYNGELSON, JD ;
WOLYNES, PG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (21) :7524-7528