Modeling structurally variable regions in homologous proteins with rosetta

被引:256
作者
Rohl, CA [1 ]
Strauss, CEM
Chivian, D
Baker, D
机构
[1] Univ Calif Santa Cruz, Dept Biomol Engn, Santa Cruz, CA 95064 USA
[2] Los Alamos Natl Lab, Biosci Div, Los Alamos, NM USA
[3] Univ Washington, Dept Biochem, Seattle, WA 98195 USA
[4] Univ Washington, Howard Hughes Med Inst, Seattle, WA 98195 USA
关键词
comparative protein structure modeling; homology modeling; fragment assembly; CASP; loop modeling; structurally variable region;
D O I
10.1002/prot.10629
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A major limitation of current comparative modeling methods is the accuracy with which regions that are structurally divergent from homologues of known structure can be modeled. Because structural differences between homologous proteins are responsible for variations in protein function and specificity, the ability to model these differences has important functional consequences. Although existing methods can provide reasonably accurate models of short loop regions, modeling longer structurally divergent regions is an unsolved problem. Here we describe a method based on the de novo structure prediction algorithm, Rosetta, for predicting conformations of struc-turally divergent regions in comparative models. Initial conformations for short segments are selected from the protein structure database, whereas longer segments are built up by using three- and nine-residue fragments drawn from the database and combined by using the Rosetta algorithm. A gap closure term in the potential in combination with modified Newton's method for gradient descent minimization is used to ensure continuity of the peptide backbone. Conformations of variable regions are refined in the context of a fixed template structure using Monte Carlo minimization together with rapid repacking of side-chains to iteratively optimize backbone torsion angles and side-chain rotamers. For short loops, mean accuracies of 0.69, 1.45, and 3.62 Angstrom are obtained for 4, 8, and 12 residue loops, respectively. In addition, the method can provide reasonable models of conformations of longer protein segments: predicted conformations of 3Angstrom root-mean-square deviation or better were obtained for 5 of 10 examples of segments ranging from 13 to 34 residues. In combination with a sequence alignment algorithm, this method generates complete, un-gapped models of protein structures, including regions both similar to and divergent from a homologous structure. This combined method was used to make predictions for 28 protein domains in the Critical Assessment of Protein Structure 4 WASP 4) and 59 domains in CASP 5, where the method ranked highly among comparative modeling and fold recognition methods. Model accuracy in these blind predictions is dominated by alignment quality, but in the context of accurate alignments, long protein segments can be accurately modeled. Notably, the method correctly predicted the local structure of a 39-residue insertion into a TIM barrel in CASP 5 target T0186. (C) 2004 Wiley-Liss, Inc.
引用
收藏
页码:656 / 677
页数:22
相关论文
共 55 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bonneau R, 2001, PROTEINS, P119
[3]   De novo protein structure determination using sparse NMR data [J].
Bowers, PM ;
Strauss, CEM ;
Baker, D .
JOURNAL OF BIOMOLECULAR NMR, 2000, 18 (04) :311-318
[4]  
BRADLEY P, 2003, IN PRESS PROTEINS
[5]   PREDICTION OF THE FOLDING OF SHORT POLYPEPTIDE SEGMENTS BY UNIFORM CONFORMATIONAL SAMPLING [J].
BRUCCOLERI, RE ;
KARPLUS, M .
BIOPOLYMERS, 1987, 26 (01) :137-168
[6]   CONFORMATIONAL SAMPLING USING HIGH-TEMPERATURE MOLECULAR-DYNAMICS [J].
BRUCCOLERI, RE ;
KARPLUS, M .
BIOPOLYMERS, 1990, 29 (14) :1847-1862
[7]   Improved protein loop prediction from sequence alone [J].
Burke, DF ;
Deane, CM .
PROTEIN ENGINEERING, 2001, 14 (07) :473-478
[8]  
CHIVIAN D, 2003, IN PRESS AUTOMATED P
[9]   Ab initio construction of polypeptide fragments: Accuracy of loop decoy discrimination by an all-atom statistical potential and the AMBER force field with the generalized born solvation model [J].
de Bakker, PIW ;
DePristo, MA ;
Burke, DF ;
Blundell, TL .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 51 (01) :21-40
[10]   CODA: A combined algorithm for predicting the structurally variable regions of protein models [J].
Deane, CM ;
Blundell, TL .
PROTEIN SCIENCE, 2001, 10 (03) :599-612