COMPASS: A tool for comparison of multiple protein alignments with assessment of statistical significance

被引:207
作者
Sadreyev, R
Grishin, N
机构
[1] Univ Texas, SW Med Ctr, Howard Hughes Med Inst, Dallas, TX 75390 USA
[2] Univ Texas, SW Med Ctr, Dept Biochem, Dallas, TX 75390 USA
关键词
sequence similarity searches; profile-profile comparison; sequence profiles; protein structure prediction; CTF/NFI;
D O I
10.1016/S0022-2836(02)01371-2
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
We present a novel method for the comparison of multiple protein alignments with assessment of statistical significance (COMPASS). The method derives numerical profiles from alignments, constructs optimal local profile-profile alignments and analytically estimates E-values for the detected similarities. The scoring system and E-value calculation are based on a generalization of the PSI-BLAST approach to profile-sequence comparison, which is adapted for the profile-profile case. Tested along with existing methods for profile-sequence (PSI-BLAST) and profile-profile (prof_sim) comparison, COMPASS shows increased abilities for sensitive and selective detection of remote sequence similarities, as well as improved quality of local alignments. The method allows prediction of relationships between protein families in the PFAM database beyond the range of conventional methods. Two predicted relations with high significance are similarities between various Rossmann-type folds and between various helix-turn-helix-containing families. The potential value of COMPASS for structure/function predictions is illustrated by the detection of an intricate homology between the DNA-binding domain of the CTF/NFI family and the MH1 domain of the Smad family. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:317 / 336
页数:20
相关论文
共 65 条