Ab initio protein structure prediction to a genomic scale:: Application to the Mycoplasma genitalium genome

被引:27
作者
Kihara, D
Zhang, Y
Lu, H
Kolinski, A
Skolnick, J
机构
[1] Donald Danforth Plant Sci Ctr, Lab Computat Genomics, St Louis, MO 63132 USA
[2] Warsaw Univ, Fac Chem, PL-02093 Warsaw, Poland
关键词
D O I
10.1073/pnas.092135699
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
An ab initio protein structure prediction procedure, TOUCHSTONE, was applied to all 85 small proteins of the Mycoplasma genitalium genome. TOUCHSTONE is based on a Monte Carlo refinement of a lattice model of proteins, which uses threading-based tertiary restraints. Such restraints are derived by extracting consensus contacts and local secondary structure from at least weakly scoring structures that, in some cases, can lack any global similarity to the sequence of interest. Selection of the native fold was done by using the convergence of the simulation from two different conformational search schemes and the lowest energy structure by a knowledge-based atomic-detailed potential. Among the 85 proteins, for 34 proteins with significant threading hits, the template structures were reasonably well reproduced. Of the remaining 51 proteins, 29 proteins converged to five or fewer clusters. In the test set, 84.8% of the proteins that converged to five or fewer clusters had a correct fold among the clusters. if this statistic is simply applied, 24 proteins (84.8% of the 29 proteins) may have correct folds. Thus, the topology of a total of 58 proteins probably has been correctly predicted. Based on these results, ab initio protein structure prediction is becoming a practical approach.
引用
收藏
页码:5993 / 5998
页数:6
相关论文
共 34 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
Bates PA, 2001, PROTEINS, P39
[3]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[4]  
Betancourt MR, 2001, J COMPUT CHEM, V22, P339, DOI 10.1002/1096-987X(200102)22:3<339::AID-JCC1006>3.0.CO
[5]  
2-R
[6]  
Betancourt MR, 2001, BIOPOLYMERS, V59, P305, DOI 10.1002/1097-0282(20011015)59:5<305::AID-BIP1027>3.3.CO
[7]  
2-Y
[8]   An overview of structural genomics [J].
Burley, SK .
NATURE STRUCTURAL BIOLOGY, 2000, 7 (Suppl 11) :932-934
[9]   Functional analysis of the Escherichia coli genome using the sequence-to-structure-to-function paradigm:: Identification of proteins exhibiting the glutaredoxin/thioredoxin disulfide oxidoreductase activity [J].
Fetrow, JS ;
Godzik, A ;
Skolnick, J .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 282 (04) :703-711
[10]   Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium [J].
Fischer, D ;
Eisenberg, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (22) :11929-11934