PSIC: profile extraction from sequence alignments with position-specific counts of independent observations

被引:165
作者
Sunyaev, SR
Eisenhaber, F
Rodchenkov, IV
Eisenhaber, B
Tumanyan, VG
Kuznetsov, EN
机构
[1] European Mol Biol Lab, D-69012 Heidelberg, Germany
[2] Max Delbruck Cent Mol Med, D-13122 Berlin, Germany
[3] Russian Acad Sci, VA Engelhardt Mol Biol Inst, Moscow 117984, Russia
[4] Moscow Inst Phys & Technol, Dolgoprudnyi, Moscow Region, Russia
[5] Russian Acad Sci, Inst Control Sci, Moscow 117806, Russia
来源
PROTEIN ENGINEERING | 1999年 / 12卷 / 05期
关键词
fold recognition; motif recognition; profile extraction; position-specific independent counts; PSIC; sequence weighting;
D O I
10.1093/protein/12.5.387
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Sequence weighting techniques are aimed at balancing redundant observed information from subsets of similar sequences in multiple alignments. Traditional approaches apply the same weight to all positions of a given sequence, hence equal efficiency of phylogenetic changes is assumed along the whole sequence. This restrictive assumption is not required for the new method PSIC (position-specific independent counts) described in this paper. The number of independent observations (counts) of an amino acid type at a given alignment position is calculated from the overall similarity of the sequences that share the amino acid type at this position with the help of statistical concepts. This approach allows the fast computation of position-specific sequence weights even for alignments containing hundreds of sequences. The PSIC approach has been applied to profile extraction and to the fold family assignment of protein sequences with known structures. Our method was shown to be very productive in finding distantly related sequences and more powerful than Hidden Markov Models or the profile methods in WiseTools and PSI-BLAST in many cases. The profile extraction routine is available on the WWW (http://www.bork.embl-heidelberg.de/PSIC or http://www.imb.ac.ru/PSIC).
引用
收藏
页码:387 / 394
页数:8
相关论文
共 29 条