SimFold energy function for de novo protein structure prediction: Consensus with Rosetta

被引:42
作者
Fujitsuka, Y
Chikenji, G
Takada, S
机构
[1] Kobe Univ, Fac Sci, Dept Chem, Kobe, Hyogo 6578501, Japan
[2] Kobe Univ, Grad Sch Sci & Technol, Dept Chem, Kobe, Hyogo 6578501, Japan
[3] CREST JST, Kobe, Hyogo, Japan
关键词
SimFold energy function; protein tertiary structure; fragment assembly; structure prediction; consensus prediction;
D O I
10.1002/prot.20748
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Predicting protein tertiary structures by in silico folding is still very difficult for proteins that have new folds. Here, we developed a coarse-grained energy function, SimFold, for de novo structure prediction, performed a benchmark test of prediction with fragment assembly simulations for 38 test proteins, and proposed consensus prediction with Rosetta. The SimFold energy consists of many terms that take into account solvent-induced effects on the basis of physicochemical consideration. In the benchmark test, SimFold succeeded in predicting native structures within 6.5 angstrom for 12 of 38 proteins; this success rate was the same as that by the publicly available version of Rosetta (ab initio version 1.2) run with default parameters. We investigated which energy terms in SimFold contribute to structure prediction performance, finding that the hydrophobic interaction is the most crucial for the prediction, whereas other sequence-specific terms have weak but positive roles. In the benchmark, well-predicted proteins by SimFold and by Rosetta were not the same for 5 of 12 proteins, which led us to introduce consensus prediction. With combined decoys, we succeeded in prediction for 16 proteins, four more than SimFold or Rosetta separately. For each of 38 proteins, structural ensembles generated by SimFold and by Rosetta were qualitatively compared by mapping sampled structural space onto two dimensions. For proteins of which one of the two methods succeeded and the other failed in prediction, the former had a less scattered ensemble located around the native. For proteins of which both methods succeeded in prediction, often two ensembles were mixed up.
引用
收藏
页码:381 / 398
页数:18
相关论文
共 49 条
[1]   Predictions without templates: New folds, secondary structure, and contacts in CASP5 [J].
Aloy, P ;
Stark, A ;
Hadley, S ;
Russell, RB .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :436-456
[2]   PRINCIPLES THAT GOVERN FOLDING OF PROTEIN CHAINS [J].
ANFINSEN, CB .
SCIENCE, 1973, 181 (4096) :223-230
[3]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[4]   De novo prediction of three-dimensional structures for major protein families [J].
Bonneau, R ;
Strauss, CEM ;
Rohl, CA ;
Chivian, D ;
Bradley, P ;
Malmström, L ;
Robertson, T ;
Baker, D .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 322 (01) :65-78
[5]  
Bonneau R, 2001, PROTEINS, P119
[6]   Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation [J].
Bradley, P ;
Chivian, D ;
Meiler, J ;
Misura, KMS ;
Rohl, CA ;
Schief, WR ;
Wedemeyer, WJ ;
Schueler-Furman, O ;
Murphy, P ;
Schonbrun, J ;
Strauss, CEM ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :457-468
[7]   SPIN-GLASSES AND THE STATISTICAL-MECHANICS OF PROTEIN FOLDING [J].
BRYNGELSON, JD ;
WOLYNES, PG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (21) :7524-7528
[8]   Structure prediction meta server [J].
Bujnicki, JM ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
BIOINFORMATICS, 2001, 17 (08) :750-751
[9]   A reversible fragment assembly method for de novo protein structure prediction [J].
Chikenji, G ;
Fujitsuka, Y ;
Takada, S .
JOURNAL OF CHEMICAL PHYSICS, 2003, 119 (13) :6895-6903
[10]  
Delano WL., 2002, The PyMOL Molecular Graphics System