A Shannon entropy-based filter detects high-quality profile-profile alignments in searches for remote homologues

被引:8
作者
Capriotti, E
Fariselli, P
Rossi, I
Casadio, R
机构
[1] Univ Bologna, Dept Biol, I-40126 Bologna, Italy
[2] Univ Bologna, CIRB, I-40126 Bologna, Italy
[3] Univ Bologna, Dept Phys, I-40126 Bologna, Italy
[4] BioDec Srl, Bologna, Italy
关键词
alignment; PSI-BLAST; sequence profile; fold recognition; remote homologues;
D O I
10.1002/prot.10564
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Detection of homologous proteins with low-sequence identity to si given target (remote homologues) is routinely performed with alignment algorithms that take advantage of sequence profile. In this article, we investigate the efficacy of different alignment procedures for the task at hand on a set of 185 protein pairs with similar structures but low-sequence similarity. Criteria based on the SCOP label detection and MaxSub scores are adopted to score the results. We investigate the efficacy of alignments based on sequence-sequence, sequence-profile, and profile-profile information. We confirm that with profile-profile alignments the results are better than with other procedures. In addition, we report, and this is novel, that the selection of the results of the profile-profile alignments can be improved by using Shannon entropy, indicating that this parameter is important to recognize good profile-profile alignments among a plethora of meaningless pairs. By this, we enhance the global search accuracy without losing sensitivity and filter out most of the erroneous alignments. We also show that when the entropy filtering is adopted, the quality of the resulting alignments is comparable to that computed for the target and template structures with CE, a structural alignment program.
引用
收藏
页码:351 / 360
页数:10
相关论文
共 33 条
[11]  
2-H
[12]   A TIME-EFFICIENT, LINEAR-SPACE LOCAL SIMILARITY ALGORITHM [J].
HUANG, XQ ;
MILLER, W .
ADVANCES IN APPLIED MATHEMATICS, 1991, 12 (03) :337-357
[13]  
Hubbard TJP, 1999, PROTEINS, P15
[14]   Improving the quality of twilight-zone alignments [J].
Jaroszewski, L ;
Rychlewski, L ;
Godzik, A .
PROTEIN SCIENCE, 2000, 9 (08) :1487-1496
[15]   GenTHREADER: An efficient and reliable protein fold recognition method for genomic sequences [J].
Jones, DT .
JOURNAL OF MOLECULAR BIOLOGY, 1999, 287 (04) :797-815
[16]  
JONES TA, 1999, PROTEINS, V3, P83
[17]   DISCUSSION OF SOLUTION FOR BEST ROTATION TO RELATE 2 SETS OF VECTORS [J].
KABSCH, W .
ACTA CRYSTALLOGRAPHICA SECTION A, 1978, 34 (SEP) :827-828
[18]   Enhanced genome annotation using structural profiles in the program 3D-PSSM [J].
Kelley, LA ;
MacCallum, RM ;
Sternberg, MJE .
JOURNAL OF MOLECULAR BIOLOGY, 2000, 299 (02) :499-520
[19]   Predicting protein three-dimensional structure [J].
Moult, J .
CURRENT OPINION IN BIOTECHNOLOGY, 1999, 10 (06) :583-588
[20]  
MURZIN AG, 1995, J MOL BIOL, V247, P536, DOI 10.1016/S0022-2836(05)80134-2