TM-align: a protein structure alignment algorithm based on the TM-score

被引:2377
作者
Zhang, Y [1 ]
Skolnick, J [1 ]
机构
[1] Univ Buffalo, Ctr Excellence Bioinformat, Buffalo, NY 14203 USA
关键词
D O I
10.1093/nar/gki524
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We have developed TM-align, a new algorithm to identify the best structural alignment between protein pairs that combines the TM-score rotation matrix and Dynamic Programming (DP). The algorithm is similar to 4 times faster than CE and 20 times faster than DALI and SAL. On average, the resulting structure alignments have higher accuracy and coverage than those provided by these most often-used methods. TM-align is applied to an all-against-all structure comparison of 10 515 representative protein chains from the Protein Data Bank (PDB) with a sequence identity cutoff < 95%: 1996 distinct folds are found when a TM-score threshold of 0.5 is used. We also use TM-align to match the models predicted by TASSER for solved non-homologous proteins in PDB. For both folded and misfolded models, TM-align can almost always find close structural analogs, with an average root mean square deviation, RMSD, of 3 angstrom and 87% alignment coverage. Nevertheless, there exists a significant correlation between the correctness of the predicted structure and the structural similarity of the model to the other proteins in the PDB. This correlation could be used to assist in model selection in blind protein structure predictions.
引用
收藏
页码:2302 / 2309
页数:8
相关论文
共 41 条
[1]   Predictions without templates: New folds, secondary structure, and contacts in CASP5 [J].
Aloy, P ;
Stark, A ;
Hadley, S ;
Russell, RB .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :436-456
[2]   Large-scale assessment of the utility of low-resolution protein structures for biochemical function assignment [J].
Arakaki, AK ;
Zhang, Y ;
Skolnick, J .
BIOINFORMATICS, 2004, 20 (07) :1087-1096
[3]   Protein structure prediction and structural genomics [J].
Baker, D ;
Sali, A .
SCIENCE, 2001, 294 (5540) :93-96
[4]   The Protein Data Bank [J].
Berman, HM ;
Westbrook, J ;
Feng, Z ;
Gilliland, G ;
Bhat, TN ;
Weissig, H ;
Shindyalov, IN ;
Bourne, PE .
NUCLEIC ACIDS RESEARCH, 2000, 28 (01) :235-242
[5]  
Betancourt MR, 2001, BIOPOLYMERS, V59, P305, DOI 10.1002/1097-0282(20011015)59:5<305::AID-BIP1027>3.3.CO
[6]  
2-Y
[7]   Rosetta predictions in CASP5: Successes, failures, and prospects for complete automation [J].
Bradley, P ;
Chivian, D ;
Meiler, J ;
Misura, KMS ;
Rohl, CA ;
Schief, WR ;
Wedemeyer, WJ ;
Schueler-Furman, O ;
Murphy, P ;
Schonbrun, J ;
Strauss, CEM ;
Baker, D .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2003, 53 :457-468
[8]   High-throughput computational and experimental techniques in structural genomics [J].
Chance, MR ;
Fiser, A ;
Sali, A ;
Pieper, U ;
Eswar, N ;
Xu, GP ;
Fajardo, JE ;
Radhakannan, T ;
Marinkovic, N .
GENOME RESEARCH, 2004, 14 (10B) :2145-2154
[9]   PROTEIN-STRUCTURE COMPARISON BY ALIGNMENT OF DISTANCE MATRICES [J].
HOLM, L ;
SANDER, C .
JOURNAL OF MOLECULAR BIOLOGY, 1993, 233 (01) :123-138
[10]   DALI - A NETWORK TOOL FOR PROTEIN-STRUCTURE COMPARISON [J].
HOLM, L ;
SANDER, C .
TRENDS IN BIOCHEMICAL SCIENCES, 1995, 20 (11) :478-480