Analysis of TASSER-based CASP7 protein structure prediction results

被引:51
作者
Zhou, Hongyi
Pandit, Shashi B.
Lee, Seung Yup
Borreguero, Jose
Chen, Huiling
Wroblewska, Liliana
Skolnick, Jeffrey
机构
关键词
template-based modeling; TASSER; MetaTASSER; PROSPECTOR_3; SPARKS; SP3; 3D-jury; fold recognition; ab initio structure prediction;
D O I
10.1002/prot.21649
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
An improved TASSER (Threading/ASSEmbly/Refinement) methodology is applied to predict the tertiary structure for all CASP7 targets. TASSER employs template identification by threading, followed by tertiary structure assembly by rearranging continuous template fragments, where conformational space is searched via Parallel Hyperbolic Monte Carlo sampling with an optimized force-field that includes knowledge-based statistical potentials and restraints derived from threading templates. The final models are selected by clustering structures from the low temperature replicas. Improvements in TASSER over CASP6 involve use of better templates from 3D-jury applied to three threading programs, PROSPECTOR_3, SP3, and SPARKS, and a fragment comparison method for better model ranking. For targets with no reliable templates, a variant of TASSER (chunk-TASSER) is also applied with potentials and restraints extracted from ab initio folded supersecondary chunks of the target to build full-length models. For all 124 CASP targets/domains, the average root-mean-square-deviation (RMSD) from native and alignment coverage the best initial threading models from 3D-jury are 6.2 angstrom and 9396, respectively. Following TASSER reassembly, the average RMSD of the best model in the template aligned region decreases to 4.9 angstrom and the average TM-score increases from 0.617 for the template to 0.678 for the best full-length model. Based on target difficulty, the average TM-scores of the final model to native are 0.904, 0.671, and 0.307 for high-accuracy template-based modeling, template-based modeling, and free modeling targets/domains, respectively. For the more difficult targets, TASSER with modest human intervention performed better in comparison to its server counterpart, MetaTASSER, which used a limited time simulation.
引用
收藏
页码:90 / 97
页数:8
相关论文
共 32 条
[21]   SEQUENCE ALIGNMENT AND PENALTY CHOICE - REVIEW OF CONCEPTS, CASE-STUDIES AND IMPLICATIONS [J].
VINGRON, M ;
WATERMAN, MS .
JOURNAL OF MOLECULAR BIOLOGY, 1994, 235 (01) :1-12
[22]  
WALLNER B, 2003, STRUCT FUNCT GENET S, V6, P534
[23]   TASSER: An automated method for the prediction of protein tertiary structures in CASP6 [J].
Zhang, Y ;
Arakaki, AK ;
Skolnick, JR .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2005, 61 :91-98
[24]   On the origin and highly likely completeness of single-domain protein structures [J].
Zhang, Y ;
Hubner, IA ;
Arakaki, AK ;
Shakhnovich, E ;
Skolnick, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2006, 103 (08) :2605-2610
[25]   TM-align: a protein structure alignment algorithm based on the TM-score [J].
Zhang, Y ;
Skolnick, J .
NUCLEIC ACIDS RESEARCH, 2005, 33 (07) :2302-2309
[26]   Automated structure prediction of weakly homologous proteins on a genomic scale' [J].
Zhang, Y ;
Skolnick, J .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2004, 101 (20) :7594-7599
[27]   SPICKER:: A clustering approach to identify near-native protein folds [J].
Zhang, Y ;
Skolnick, J .
JOURNAL OF COMPUTATIONAL CHEMISTRY, 2004, 25 (06) :865-871
[28]   Scoring function for automated assessment of protein structure template quality [J].
Zhang, Y ;
Skolnick, J .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2004, 57 (04) :702-710
[29]   Local energy landscape flattening: Parallel hyperbolic Monte Carlo sampling of protein folding [J].
Zhang, Y ;
Kihara, D ;
Skolnick, J .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 2002, 48 (02) :192-201
[30]  
ZHOU H, 2005, PROTEINS, V7, P152