A multi-template combination algorithm for protein comparative modeling

被引:75
作者
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Inst Informat, Dept Comp Sci, Columbia, MO 65211 USA
关键词
D O I
10.1186/1472-6807-8-18
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Background: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available. Results: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i. e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multitemplate combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10(-4)). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure. Conclusion: We have developed a novel multi-template algorithm to improve protein comparative modeling.
引用
收藏
页数:13
相关论文
共 95 条
[1]   Combining multiple structure and sequence alignments to improve sequence detection and alignment: Application to the SH2 domains of Janus kinases [J].
Al-Lazikani, B ;
Sheinerman, FB ;
Honig, B .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2001, 98 (26) :14796-14801
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]  
Bates PA, 2001, PROTEINS, P39
[4]   Automated server predictions in CASP7 [J].
Battey, James N. D. ;
Kopp, Jurgen ;
Bordoli, Lorenza ;
Read, Randy J. ;
Clarke, Neil D. ;
Schwede, Torsten .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2007, 69 :68-82
[5]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[6]   COMPUTER-AIDED-DESIGN IN PROTEIN ENGINEERING [J].
BLUNDELL, T ;
STERNBERG, MJE .
TRENDS IN BIOTECHNOLOGY, 1985, 3 (09) :228-235
[7]   3-DIMENSIONAL STRUCTURAL ASPECTS OF THE DESIGN OF NEW-PROTEIN MOLECULES [J].
BLUNDELL, TL ;
BARLOW, D ;
SIBANDA, BL ;
THORNTON, JM ;
TAYLOR, W ;
TICKLE, IJ ;
STERNBERG, MJE ;
PITTS, JE ;
HANEEF, I ;
HEMMINGS, AM .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES, 1986, 317 (1540) :333-344
[8]   KNOWLEDGE-BASED PREDICTION OF PROTEIN STRUCTURES AND THE DESIGN OF NOVEL MOLECULES [J].
BLUNDELL, TL ;
SIBANDA, BL ;
STERNBERG, MJE ;
THORNTON, JM .
NATURE, 1987, 326 (6111) :347-352
[9]   A tour of structural genomics [J].
Brenner, SE .
NATURE REVIEWS GENETICS, 2001, 2 (10) :801-809
[10]   A POSSIBLE 3-DIMENSIONAL STRUCTURE OF BOVINE ALPHA-LACTALBUMIN BASED ON THAT OF HENS EGG-WHITE LYSOZYME [J].
BROWNE, WJ ;
NORTH, ACT ;
PHILLIPS, DC .
JOURNAL OF MOLECULAR BIOLOGY, 1969, 42 (01) :65-&