Using evolutionary information for the query and target improves fold recognition

被引:27
作者
Wallner, B [1 ]
Fang, HS [1 ]
Ohlson, T [1 ]
Frey-Skött, J [1 ]
Elofsson, A [1 ]
机构
[1] Stockholm Univ, Stockholm Bioinformat Ctr, SE-10691 Stockholm, Sweden
关键词
sequence alignment; fold recognition; profile-profile; sequence linking; intermediate sequence searches; PSI-BLAST; E-value;
D O I
10.1002/prot.10565
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
In this study, we show that it is possible to increase the performance over PSI-BLAST by using evolutionary information for both query and target sequences. This information can be used in three different ways: by sequence linking, profile-profile alignments, and by combining sequence-profile and profile-sequence searches. If only PSI-BLAST is used, 16% of superfamily-related protein domains can be detected at 90% specificity, but if a sequence-profile and a profile-sequence search are combined, this is increased to 20%, profile profile searches detects 19%, whereas a linking procedure identifies 22% of these proteins. All three methods show equal performance, but the best combination of speed and accuracy seems to be obtained by the combined searches, because this method shows a good performance even at high specificity and the lowest computational cost. In addition, we show that the E-values reported by all these methods, including PSI-BLAST, underestimate the true rate of false positives. This behavior is seen even if a very strict E-value cutoff and a limited number of iterations are used. However, the difference is more pronounced with a looser E-value cutoff and more iterations.
引用
收藏
页码:342 / 350
页数:9
相关论文
共 36 条
[31]  
Schäffer AA, 1999, BIOINFORMATICS, V15, P1000
[32]  
Smith SS, 1998, INT J MOL MED, V1, P147
[33]  
Venclovas C, 2001, PROTEINS, P47
[34]  
VONOHSEN N, 2001, ALGORITHMS BIOINFORM, V2149, P11
[35]   Can correct protein models be identified? [J].
Wallner, B ;
Elofsson, A .
PROTEIN SCIENCE, 2003, 12 (05) :1073-1086
[36]   Within the twilight zone: A sensitive profile-profile comparison tool based on information theory [J].
Yona, G ;
Levitt, M .
JOURNAL OF MOLECULAR BIOLOGY, 2002, 315 (05) :1257-1275