From fold recognition to homology modeling: an analysis of protein modeling challenges at different levels of prediction complexity

被引:7
作者
Olszewski, KA [1 ]
Yan, L [1 ]
Edwards, D [1 ]
Yeh, T [1 ]
机构
[1] Mol Simulat, San Diego, CA 92121 USA
来源
COMPUTERS & CHEMISTRY | 2000年 / 24卷 / 3-4期
关键词
homology modeling; fold recognition; CASP3;
D O I
10.1016/S0097-8485(99)00078-9
中图分类号
O6 [化学];
学科分类号
0703 ;
摘要
An analysis of different approaches to protein structure prediction is presented based solely on the range of models submitted to the third Critical Assessment of Protein Structure Prediction (CASP3) conference. CASP conferences evaluate the current state of the art of protein structure prediction by comparing blind prediction efforts of many groups for the same set of target sequences. Target sequences may be highly similar to those with known structure or can be totally (at least superficially) sequentially dissimilar. Techniques applied to those blind predictions lover 40 targets) ranges from a detailed homology prediction to the detection of remote homologues well below a twilight zone of protein sequence similarity. For the CASP3 conference, we have submitted predictions, totaling 35, with various levels of difficulty and complexity. For ten submitted homology targets, eight of them were determined by experiment so far. The RMSD of C-alpha atoms are 1.2-1.7, 2.3, and 4.6-17.9 Angstrom for the three easy targets, two hard targets, and three very hard homology targets, respectively. Out of 18-fold recognition predictions available for analysis, we got six correct predictions, five near misses, three tough near misses and four far misses. Here we analyze successes and failures of those predictions in an attempt to identify common problems and common achievements. (C) 2000 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:499 / 510
页数:12
相关论文
共 41 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
ALTSCHUL SF, 1990, J MOL BIOL, V215, P403, DOI 10.1006/jmbi.1990.9999
[3]   Pfam 3.1: 1313 multiple alignments and profile HMMs match the majority of proteins [J].
Bateman, A ;
Birney, E ;
Durbin, R ;
Eddy, SR ;
Finn, RD ;
Sonnhammer, ELL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (01) :260-262
[4]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[5]   A METHOD TO IDENTIFY PROTEIN SEQUENCES THAT FOLD INTO A KNOWN 3-DIMENSIONAL STRUCTURE [J].
BOWIE, JU ;
LUTHY, R ;
EISENBERG, D .
SCIENCE, 1991, 253 (5016) :164-170
[6]  
Di Francesco V, 1997, PROTEINS, P123
[7]   Hidden Markov models [J].
Eddy, SR .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :361-365
[8]   Assigning folds to the proteins encoded by the genome of Mycoplasma genitalium [J].
Fischer, D ;
Eisenberg, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (22) :11929-11934
[9]   SEQUENCE STRUCTURE MATCHING IN GLOBULAR-PROTEINS - APPLICATION TO SUPERSECONDARY AND TERTIARY STRUCTURE DETERMINATION [J].
GODZIK, A ;
SKOLNICK, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (24) :12098-12102
[10]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358