Structure-based evaluation of sequence comparison and fold recognition alignment accuracy

被引:81
作者
Domingues, FS [1 ]
Lackner, P [1 ]
Andreeva, A [1 ]
Sippl, MJ [1 ]
机构
[1] Salzburg Univ, Ctr Appl Mol Engn, Inst Chem & Biochem, A-5020 Salzburg, Austria
基金
奥地利科学基金会;
关键词
threading; sequence alignment; structure comparison; structure superposition; structure similarity;
D O I
10.1006/jmbi.2000.3615
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
The biological role, biochemical function, and structure of uncharacterized protein sequences is often inferred from their similarity to known proteins. A constant goal is to increase the reliability, sensitivity, and accuracy of alignment techniques to enable the detection of increasingly distant relationships. Development, tuning, and testing of these methods benefit from appropriate benchmarks for the assessment of alignment accuracy. Here, we describe a benchmark protocol to estimate sequence-to-sequence and sequence-to-structure alignment accuracy. The protocol consists of structurally related pairs of proteins and procedures to evaluate alignment accuracy over the whole set. The set of protein pairs covers all the currently known fold types. The benchmark is challenging in the sense that it consists of proteins lacking clear sequence similarity. Correct target alignments are derived from the three-dimensional structures of these pairs by rigid body superposition. An evaluation engine computes the accuracy of alignments obtained from a particular algorithm in terms of alignment shifts with respect to the structure derived alignments. Using this benchmark we estimate that the best results can be obtained from a combination of amino acid residue substitution matrices and knowledge-based potentials. (C) 2000 Academic Press.
引用
收藏
页码:1003 / 1013
页数:11
相关论文
共 40 条
[1]  
Alexandrov NN, 1998, PROTEIN SCI, V7, P254
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   PROTEIN DATA BANK - COMPUTER-BASED ARCHIVAL FILE FOR MACROMOLECULAR STRUCTURES [J].
BERNSTEIN, FC ;
KOETZLE, TF ;
WILLIAMS, GJB ;
MEYER, EF ;
BRICE, MD ;
RODGERS, JR ;
KENNARD, O ;
SHIMANOUCHI, T ;
TASUMI, M .
JOURNAL OF MOLECULAR BIOLOGY, 1977, 112 (03) :535-542
[4]   Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships [J].
Brenner, SE ;
Chothia, C ;
Hubbard, TJP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (11) :6073-6078
[5]  
Bryant SH, 1996, PROTEINS, V26, P172
[6]  
Domingues FS, 1999, PROTEINS, P112
[7]  
Feng ZK, 1996, FOLD DES, V1, P123
[8]  
Fischer D, 1996, Pac Symp Biocomput, P300
[9]   PROGRESS IN FOLD RECOGNITION [J].
FLOCKNER, H ;
BRAXENTHALER, M ;
LACKNER, P ;
JARITZ, M ;
ORTNER, M ;
SIPPL, MJ .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (03) :376-386
[10]  
GERSTEIN M, 1996, 4 INT C INT SYST MOL, V4, P59