A multi-template combination algorithm for protein comparative modeling

被引:75
作者
Cheng, Jianlin [1 ]
机构
[1] Univ Missouri, Inst Informat, Dept Comp Sci, Columbia, MO 65211 USA
关键词
D O I
10.1186/1472-6807-8-18
中图分类号
Q6 [生物物理学];
学科分类号
071011 ;
摘要
Background: Multiple protein templates are commonly used in manual protein structure prediction. However, few automated algorithms of selecting and combining multiple templates are available. Results: Here we develop an effective multi-template combination algorithm for protein comparative modeling. The algorithm selects templates according to the similarity significance of the alignments between template and target proteins. It combines the whole template-target alignments whose similarity significance score is close to that of the top template-target alignment within a threshold, whereas it only takes alignment fragments from a less similar template-target alignment that align with a sizable uncovered region of the target. We compare the algorithm with the traditional method of using a single top template on the 45 comparative modeling targets (i. e. easy template-based modeling targets) used in the seventh edition of Critical Assessment of Techniques for Protein Structure Prediction (CASP7). The multitemplate combination algorithm improves the GDT-TS scores of predicted models by 6.8% on average. The statistical analysis shows that the improvement is significant (p-value < 10(-4)). Compared with the ideal approach that always uses the best template, the multi-template approach yields only slightly better performance. During the CASP7 experiment, the preliminary implementation of the multi-template combination algorithm (FOLDpro) was ranked second among 67 servers in the category of high-accuracy structure prediction in terms of GDT-TS measure. Conclusion: We have developed a novel multi-template algorithm to improve protein comparative modeling.
引用
收藏
页数:13
相关论文
共 95 条
[71]   Analysis and assessment of comparative modeling predictions in CASP4 [J].
Tramontano, A ;
Leplae, R ;
Morea, V .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2001, :22-38
[72]  
TRAMONTANO A, 2006, PROTEIN STRUTURE PRE
[73]   Assessment of predictions submitted for the CASP6 comparative modeling category [J].
Tress, M ;
Ezkurdia, L ;
Graña, O ;
López, G ;
Valencia, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :27-45
[74]   Comparative modeling in CASP6 using consensus approach to template selection, sequence-structure alignment, and structure assessment [J].
Venclovas, C ;
Margelevicius, M .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :99-105
[75]   Comparative modeling in CASP5: Progress is evident, but alignment errors remain a significant hindrance [J].
Venclovas, C .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :380-388
[76]   Completeness in structural genomics [J].
Vitkup, D ;
Melamud, E ;
Moult, J ;
Sander, C .
NATURE STRUCTURAL BIOLOGY, 2001, 8 (06) :559-566
[77]   Pcons5: combining consensus, structural evaluation and fold recognition scores [J].
Wallner, B ;
Elofsson, A .
BIOINFORMATICS, 2005, 21 (23) :4248-4254
[78]   All are not equal: A benchmark of different homology modeling programs [J].
Wallner, B ;
Elofsson, A .
PROTEIN SCIENCE, 2005, 14 (05) :1315-1327
[79]   Using evolutionary information for the query and target improves fold recognition [J].
Wallner, B ;
Fang, HS ;
Ohlson, T ;
Frey-Skött, J ;
Elofsson, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 54 (02) :342-350
[80]   The Protein Data Bank and structural genomics [J].
Westbrook, J ;
Feng, ZK ;
Chen, L ;
Yang, HW ;
Berman, HM .
NUCLEIC ACIDS RESEARCH, 2003, 31 (01) :489-491