An evaluation of automated homology modelling methods at low target-template sequence similarity

被引:77
作者
Dalton, James A. R. [1 ]
Jackson, Richard M. [1 ]
机构
[1] Univ Leeds, Fac Biol Sci, Inst Mol & Cellular Biol, Leeds LS2 9JT, W Yorkshire, England
基金
英国生物技术与生命科学研究理事会;
关键词
D O I
10.1093/bioinformatics/btm262
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: There are two main areas of difficulty in homology modelling that are particularly important when sequence identity between target and template falls below 50%: sequence alignment and loop building. These problems become magnified with automatic modelling processes, as there is no human input to correct mistakes. As such we have benchmarked several stand-alone strategies that could be implemented in a workflow for automated high-throughput homology modelling. These include three new sequence-structure alignment programs: 3D-Coffee, Staccato and SAlign, plus five homology modelling programs and their respective loop building methods: Builder, Nest, Modeller, SegMod/ENCAD and Swiss-Model. The SABmark database provided 123 targets with at least five templates from the same SCOP family and sequence identities <= 50%. Results: When using Modeller as the common modelling program, 3D-Coffee outperforms Staccato and SAlign using both multiple templates and the best single template, and across the sequence identity range 20-50%. The mean model RMSD generated from 3D-Coffee using multiple templates is 15 and 28% (or using single templates, 3 and 13%) better than those generated by Staccato and Salign, respectively. 3D-Coffee gives equivalent modelling accuracy from multiple and single templates, but Staccato and SAlign are more successful with single templates, their quality deteriorating as additional lower sequence identity templates are added. Evaluating the different homology modelling programs, on average Modeller performs marginally better in overall modelling than the others tested. However, on average Nest produces the best loops with an 8% improvement by mean RMSD compared to the loops generated by Builder.
引用
收藏
页码:1901 / 1908
页数:8
相关论文
共 31 条
[1]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[2]  
Bates PA, 2001, PROTEINS, P39
[3]   THE RELATION BETWEEN THE DIVERGENCE OF SEQUENCE AND STRUCTURE IN PROTEINS [J].
CHOTHIA, C ;
LESK, AM .
EMBO JOURNAL, 1986, 5 (04) :823-826
[4]   A study on protein sequence alignment quality [J].
Elofsson, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (03) :330-339
[5]   Modeling of loops in protein structures [J].
Fiser, A ;
Do, RKG ;
Sali, A .
PROTEIN SCIENCE, 2000, 9 (09) :1753-1773
[6]  
Gerstein M, 1998, PROTEIN SCI, V7, P445
[7]   CE-MC: a multiple protein structure alignment server [J].
Guda, C ;
Lu, SF ;
Scheeff, ED ;
Bourne, PE ;
Shindyalov, IN .
NUCLEIC ACIDS RESEARCH, 2004, 32 :W100-W103
[8]   Utility of homology models in the drug discovery process [J].
Hillisch, A ;
Pineda, LF ;
Hilgenfeld, R .
DRUG DISCOVERY TODAY, 2004, 9 (15) :659-669
[9]   A SELF-CONSISTENT MEAN-FIELD APPROACH TO SIMULTANEOUS GAP CLOSURE AND SIDE-CHAIN POSITIONING IN HOMOLOGY MODELING [J].
KOEHL, P ;
DELARUE, M .
NATURE STRUCTURAL BIOLOGY, 1995, 2 (02) :163-170
[10]   Automated protein structure homology modeling: a progress report [J].
Kopp, J ;
Schwede, T .
PHARMACOGENOMICS, 2004, 5 (04) :405-416