Comprehensive evaluation of protein structure alignment methods: Scoring by geometric measures

被引:214
作者
Kolodny, R
Koehl, P
Levitt, M
机构
[1] Stanford Univ, Dept Biol Struct, Stanford, CA 94305 USA
[2] Stanford Univ, Dept Comp Sci, Stanford, CA 94305 USA
关键词
comparison of structural alignment; protein structure alignment; protein structure comparison; geometric measures; ROC curves;
D O I
10.1016/j.jmb.2004.12.032
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We report the largest and most comprehensive comparison of protein structural alignment methods. Specifically, we evaluate six publicly available structure alignment programs: SSAP, STRUCTAL, DALI, LSQMAN, CE and SSM by aligning all 8,581,970 protein structure pairs in a test set of 2930 protein domains specially selected from CATH v.2.4 to ensure sequence diversity. We consider an alignment good if it matches many residues, and the two substructures are geometrically similar. Even with this definition, evaluating structural alignment methods is not straightforward. At first, we compared the rates of true and false positives using receiver operating characteristic (ROC) curves with the CATH classification taken as a gold standard. This proved unsatisfactory in that the quality of the alignments is not taken into account: sometimes a method that finds less good alignments scores better than a method that finds better alignments. We correct this intrinsic limitation by using four different geometric match measures (SI, MI, SAS, and GSAS) to evaluate the quality of each structural alignment. With this improved analysis we show that there is a wide variation in the performance of different methods; the main reason for this is that it can be difficult to find a good structural alignment between two proteins even when such an alignment exists. We find that STRUCTAL and SSM perform best, followed by LSQMAN and CE. Our focus on the intrinsic quality of each alignment allows us to propose a new method, called "Best-of-All" that combines the best results of all methods. Many commonly used methods miss 10-50% of the good Best-of-All alignments. By putting existing structural alignments into proper perspective, our study allows better comparison of protein structures. By highlighting limitations of existing methods, it will spur the further development of better structural alignment methods. This will have significant biological implications now that structural comparison has come to play a central role in the analysis of experimental work on protein structure, protein function and protein evolution. (C) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1173 / 1188
页数:16
相关论文
共 47 条
[41]  
2-K
[42]   Sensitivity and selectivity in protein structure comparison [J].
Sierk, ML ;
Pearson, WR .
PROTEIN SCIENCE, 2004, 13 (03) :773-785
[43]   A model for statistical significance of local similarities in structure [J].
Stark, A ;
Sunyaev, S ;
Russell, RB .
JOURNAL OF MOLECULAR BIOLOGY, 2003, 326 (05) :1307-1316
[44]   STRUCTURAL SIMILARITY OF DNA-BINDING DOMAINS OF BACTERIOPHAGE REPRESSORS AND THE GLOBIN CORE [J].
SUBBIAH, S ;
LAURENTS, DV ;
LEVITT, M .
CURRENT BIOLOGY, 1993, 3 (03) :141-148
[45]   PROTEIN-STRUCTURE ALIGNMENT [J].
TAYLOR, WR ;
ORENGO, CA .
JOURNAL OF MOLECULAR BIOLOGY, 1989, 208 (01) :1-22
[46]   BAliBASE: a benchmark alignment database for the evaluation of multiple alignment programs [J].
Thompson, JD ;
Plewniak, F ;
Poch, O .
BIOINFORMATICS, 1999, 15 (01) :87-88
[47]   An integrated approach to the analysis and modeling of protein sequences and structures. I. Protein structural alignment and a quantitative measure for protein structural distance [J].
Yang, AS ;
Honig, B .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 301 (03) :665-678