Consensus alignment for reliable framework prediction in homology modeling

被引:25
作者
Prasad, JC
Comeau, SR
Vajda, S
Camacho, CJ [1 ]
机构
[1] Boston Univ, Bioinformat Grad Program, Boston, MA 02215 USA
[2] Boston Univ, Dept Biomed Engn, Boston, MA 02215 USA
关键词
D O I
10.1093/bioinformatics/btg211
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: Even the best sequence alignment methods frequently fail to correctly identify the framework regions for which backbones can be copied from the template into the target structure. Since the underprediction and, more significantly, the overprediction of these regions reduces the quality of the final model, it is of prime importance to attain as much as possible of the true structural alignment between target and template. Results: We have developed an algorithm called Consensus that consistently provides a high quality alignment for comparative modeling. The method follows from a benchmark analysis of the 3D models generated by ten alignment techniques for a set of 79 homologous protein structure pairs. For 20-to-40% of the targets, these methods yield models with at least 6 Angstrom root mean square deviation (RMSD) from the native structure. We have selected the top five performing methods, and developed a consensus algorithm to generate an improved alignment. By building on the individual strength of each method, a set of criteria was implemented to remove the alignment segments that are likely to correspond to structurally dissimilar regions. The automated algorithm was validated on a different set of 48 protein pairs, resulting in 2.2 Angstrom average RMSD for the predicted models, and only four cases in which the RMSD exceeded 3 Angstrom. The average length of the alignments was about 75% of that found by standard structural superposition methods. The performance of Consensus was consistent from 2 to 32% target-template sequence identity, and hence it can be used for accurate prediction of framework regions in homology modeling.
引用
收藏
页码:1682 / 1691
页数:10
相关论文
共 29 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[3]   Protein-protein association kinetics and protein docking [J].
Camacho, CJ ;
Vajda, S .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 2002, 12 (01) :36-40
[4]   Predicting reliable regions in protein sequence alignments [J].
Cline, M ;
Hughey, R ;
Karplus, K .
BIOINFORMATICS, 2002, 18 (02) :306-314
[5]  
Cuff JA, 2000, PROTEINS, V40, P502, DOI 10.1002/1097-0134(20000815)40:3<502::AID-PROT170>3.0.CO
[6]  
2-Q
[7]   SCORE: predicting the core of protein models [J].
Deane, CM ;
Kaas, Q ;
Blundell, TL .
BIOINFORMATICS, 2001, 17 (06) :541-550
[8]   Profile hidden Markov models [J].
Eddy, SR .
BIOINFORMATICS, 1998, 14 (09) :755-763
[9]   A study on protein sequence alignment quality [J].
Elofsson, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (03) :330-339
[10]  
Fiser A, 2001, COMPUTATIONAL BIOCHEMISTRY AND BIOPHYSICS, P275