Comparative modeling in CASP5: Progress is evident, but alignment errors remain a significant hindrance

被引:44
作者
Venclovas, C [1 ]
机构
[1] Lawrence Livermore Natl Lab, Biol & Biotechnol Res Program, Livermore, CA 94551 USA
关键词
protein structure prediction; sequence-structure alignment; 3D model; model evaluation; distant homology; alignment errors;
D O I
10.1002/prot.10591
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Models for 20 comparative modeling targets were submitted for the fifth round of the "blind" test of protein structure prediction methods (CASP5; http://predictioncenter.llnl.gov/casp5). The modeling approach used in CASP5 was similar to that used 2 years ago in CASP4 (Venclovas, Proteins 2001; Suppl 5:47-54). The main features of this approach include use of multiple templates, initial assessment of alignment reliability in a region-specific manner, and structure-based selection of alignment variants in unreliable regions. The CASP5 modeling results presented here show significant improvement in comparison to CASP4, especially in the area of distant homology. The improvements include more effective use of multiple templates and better alignments. However, a number of structurally conserved regions in submitted distant homology models were misaligned. Analysis of these errors indicates that the absolute majority of them occurred in regions deemed unreliable in the course of model building. Most of these error-prone regions can be characterized by their peripheral location and a lack of conserved sequence patterns. For a few of the error-prone regions, all methods evaluated during CASP5 proved ineffective, pointing to the need for more sensitive energy-based methods. Despite these remaining issues, the applicability of comparative modeling continues to expand into more distant evolutionary relationships, providing a means to structurally characterize a significant number of currently available protein sequences. (C) 2003 Wiley-Liss, Inc.
引用
收藏
页码:380 / 388
页数:9
相关论文
共 25 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   DNA polymerase β-like nucleotidyltransferase superfamily:: identification of three new families, classification and evolutionary history [J].
Aravind, L ;
Koonin, EV .
NUCLEIC ACIDS RESEARCH, 1999, 27 (07) :1609-1618
[3]   Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: A new homology modeling tool [J].
Bower, MJ ;
Cohen, FE ;
Dunbrack, RL .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 267 (05) :1268-1282
[4]   GCN5-related N-acetyltransferases: A structural overview [J].
Dyda, F ;
Klein, DC ;
Hickman, AB .
ANNUAL REVIEW OF BIOPHYSICS AND BIOMOLECULAR STRUCTURE, 2000, 29 :81-103
[5]   The structural basis of ordered substrate binding by serotonin N-acetyltransferase:: Enzyme complex at 1.8 Å resolution with a bisubstrate analog [J].
Hickman, AB ;
Namboodiri, MAA ;
Klein, DC ;
Dyda, F .
CELL, 1999, 97 (03) :361-369
[6]   Protein secondary structure prediction based on position-specific scoring matrices [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 292 (02) :195-202
[7]   DICTIONARY OF PROTEIN SECONDARY STRUCTURE - PATTERN-RECOGNITION OF HYDROGEN-BONDED AND GEOMETRICAL FEATURES [J].
KABSCH, W ;
SANDER, C .
BIOPOLYMERS, 1983, 22 (12) :2577-2637
[8]   MOLSCRIPT - A PROGRAM TO PRODUCE BOTH DETAILED AND SCHEMATIC PLOTS OF PROTEIN STRUCTURES [J].
KRAULIS, PJ .
JOURNAL OF APPLIED CRYSTALLOGRAPHY, 1991, 24 :946-950
[9]   Raster3D: Photorealistic molecular graphics [J].
Merritt, EA ;
Bacon, DJ .
MACROMOLECULAR CRYSTALLOGRAPHY, PT B, 1997, 277 :505-524
[10]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2