A Shannon entropy-based filter detects high-quality profile-profile alignments in searches for remote homologues

被引:8
作者
Capriotti, E
Fariselli, P
Rossi, I
Casadio, R
机构
[1] Univ Bologna, Dept Biol, I-40126 Bologna, Italy
[2] Univ Bologna, CIRB, I-40126 Bologna, Italy
[3] Univ Bologna, Dept Phys, I-40126 Bologna, Italy
[4] BioDec Srl, Bologna, Italy
关键词
alignment; PSI-BLAST; sequence profile; fold recognition; remote homologues;
D O I
10.1002/prot.10564
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Detection of homologous proteins with low-sequence identity to si given target (remote homologues) is routinely performed with alignment algorithms that take advantage of sequence profile. In this article, we investigate the efficacy of different alignment procedures for the task at hand on a set of 185 protein pairs with similar structures but low-sequence similarity. Criteria based on the SCOP label detection and MaxSub scores are adopted to score the results. We investigate the efficacy of alignments based on sequence-sequence, sequence-profile, and profile-profile information. We confirm that with profile-profile alignments the results are better than with other procedures. In addition, we report, and this is novel, that the selection of the results of the profile-profile alignments can be improved by using Shannon entropy, indicating that this parameter is important to recognize good profile-profile alignments among a plethora of meaningless pairs. By this, we enhance the global search accuracy without losing sensitivity and filter out most of the erroneous alignments. We also show that when the entropy filtering is adopted, the quality of the resulting alignments is comparable to that computed for the target and template structures with CE, a structural alignment program.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 33 条
[1]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[2]  
BURKHARD R, 1999, PROTEIN ENG, V12, P85
[3]   PROGRESS IN FOLD RECOGNITION [J].
FLOCKNER, H ;
BRAXENTHALER, M ;
LACKNER, P ;
JARITZ, M ;
ORTNER, M ;
SIPPL, MJ .
PROTEINS-STRUCTURE FUNCTION AND GENETICS, 1995, 23 (03) :376-386
[4]   Surprising similarities in structure comparison [J].
Gibrat, JF ;
Madej, T ;
Bryant, SH .
CURRENT OPINION IN STRUCTURAL BIOLOGY, 1996, 6 (03) :377-385
[5]   The structural alignment between two proteins: Is there a unique answer? [J].
Godzik, A .
PROTEIN SCIENCE, 1996, 5 (07) :1325-1338
[6]   REGULARITIES IN INTERACTION PATTERNS OF GLOBULAR-PROTEINS [J].
GODZIK, A ;
SKOLNICK, J ;
KOLINSKI, A .
PROTEIN ENGINEERING, 1993, 6 (08) :801-810
[7]   AMINO-ACID SUBSTITUTION MATRICES FROM PROTEIN BLOCKS [J].
HENIKOFF, S ;
HENIKOFF, JG .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1992, 89 (22) :10915-10919
[8]  
HERMJAKOB H, 1998, SPTR COMPREHENSIVE N
[9]   Mapping the protein universe [J].
Holm, L ;
Sander, C .
SCIENCE, 1996, 273 (5275) :595-602
[10]  
Holm L, 1998, PROTEINS, V33, P88, DOI 10.1002/(SICI)1097-0134(19981001)33:1<88::AID-PROT8>3.0.CO