Using evolutionary information for the query and target improves fold recognition

被引:27
作者
Wallner, B [1 ]
Fang, HS [1 ]
Ohlson, T [1 ]
Frey-Skött, J [1 ]
Elofsson, A [1 ]
机构
[1] Stockholm Univ, Stockholm Bioinformat Ctr, SE-10691 Stockholm, Sweden
关键词
sequence alignment; fold recognition; profile-profile; sequence linking; intermediate sequence searches; PSI-BLAST; E-value;
D O I
10.1002/prot.10565
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this study, we show that it is possible to increase the performance over PSI-BLAST by using evolutionary information for both query and target sequences. This information can be used in three different ways: by sequence linking, profile-profile alignments, and by combining sequence-profile and profile-sequence searches. If only PSI-BLAST is used, 16% of superfamily-related protein domains can be detected at 90% specificity, but if a sequence-profile and a profile-sequence search are combined, this is increased to 20%, profile profile searches detects 19%, whereas a linking procedure identifies 22% of these proteins. All three methods show equal performance, but the best combination of speed and accuracy seems to be obtained by the combined searches, because this method shows a good performance even at high specificity and the lowest computational cost. In addition, we show that the E-values reported by all these methods, including PSI-BLAST, underestimate the true rate of false positives. This behavior is seen even if a very strict E-value cutoff and a limited number of iterations are used. However, the difference is more pronounced with a looser E-value cutoff and more iterations.
引用
收藏
页码:342 / 350
页数:9
相关论文
共 36 条
[1]   Do aligned sequences share the same fold? [J].
Abagyan, RA ;
Batalov, S .
JOURNAL OF MOLECULAR BIOLOGY, 1997, 273 (01) :355-368
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   Assessing sequence comparison methods with reliable structurally identified distant evolutionary relationships [J].
Brenner, SE ;
Chothia, C ;
Hubbard, TJP .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (11) :6073-6078
[4]  
Bujnicki JM, 2001, PROTEINS, P184
[5]   LiveBench-1: Continuous benchmarking of protein structure prediction servers [J].
Bujnicki, JM ;
Elofsson, A ;
Fischer, D ;
Rychlewski, L .
PROTEIN SCIENCE, 2001, 10 (02) :352-361
[6]   A study on protein sequence alignment quality [J].
Elofsson, A .
PROTEINS-STRUCTURE FUNCTION AND BIOINFORMATICS, 2002, 46 (03) :330-339
[7]   Detection of unrelated proteins in sequences multiple alignments by using predicted secondary structures [J].
Errami, M ;
Geourjon, C ;
Deléage, G .
BIOINFORMATICS, 2003, 19 (04) :506-512
[8]  
Fischer D, 1996, PROTEIN SCI, V5, P947
[9]  
Fischer D, 2000, PAC S BIOCOMPUT, V2000, P116
[10]   Assignment of homology to genome sequences using a library of hidden Markov models that represent all proteins of known structure [J].
Gough, J ;
Karplus, K ;
Hughey, R ;
Chothia, C .
JOURNAL OF MOLECULAR BIOLOGY, 2001, 313 (04) :903-919