PROTEIN DATABASE SEARCHES FOR MULTIPLE ALIGNMENTS

被引:411
作者
ALTSCHUL, SF
LIPMAN, DJ
机构
关键词
Alignment algorithms; Homology; Pattern recognition; Sequence comparison; Statistical significance;
D O I
10.1073/pnas.87.14.5509
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Protein database searches frequently can reveal biologically significant sequence relationships useful in understanding structure and function. Weak but meaningful sequence patterns can be obscured, however, by other similarities due only to chance. By searching a database for multiple as opposed to pairwise alignments, distant relationships are much more easily distinguished from background noise. Recent statistical results permit the power of this approach to be analyzed. Given a typical query sequence, an algorithm described here permits the current protein database to be searched for three-sequence alignments in less than 4 min. Such searches have revealed a variety of subtle relationships that pairwise search methods would be unable to detect.
引用
收藏
页码:5509 / 5513
页数:5
相关论文
共 25 条
[1]   PARTITION OF UNIT-COPY MINIPLASMIDS TO DAUGHTER CELLS .3. THE DNA-SEQUENCE AND FUNCTIONAL-ORGANIZATION OF THE P1-PARTITION REGION [J].
ABELES, AL ;
FRIEDMAN, SA ;
AUSTIN, SJ .
JOURNAL OF MOLECULAR BIOLOGY, 1985, 185 (02) :261-272
[2]  
ALTSCHUL S, 1990, IN PRESS J MOL BIOL
[3]   TREES, STARS, AND MULTIPLE BIOLOGICAL SEQUENCE ALIGNMENT [J].
ALTSCHUL, SF ;
LIPMAN, DJ .
SIAM JOURNAL ON APPLIED MATHEMATICS, 1989, 49 (01) :197-209
[4]   PROTEIN AND NUCLEIC-ACID SEQUENCE DATABASE SEARCHING - A SUITABLE CASE FOR PARALLEL PROCESSING [J].
COULSON, AFW ;
COLLINS, JF ;
LYALL, A .
COMPUTER JOURNAL, 1987, 30 (05) :420-424
[5]  
Dayhoff MO, 1978, ATL PROTEIN SEQ STRU, V5, P345
[6]   PROFILE ANALYSIS - DETECTION OF DISTANTLY RELATED PROTEINS [J].
GRIBSKOV, M ;
MCLACHLAN, AD ;
EISENBERG, D .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1987, 84 (13) :4355-4358
[7]  
ISHIOKA N, 1986, P NATL ACAD SCI USA, V83, P2363, DOI 10.1073/pnas.83.8.2363
[8]   METHODS FOR ASSESSING THE STATISTICAL SIGNIFICANCE OF MOLECULAR SEQUENCE FEATURES BY USING GENERAL SCORING SCHEMES [J].
KARLIN, S ;
ALTSCHUL, SF .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1990, 87 (06) :2264-2268
[9]  
KARLIN S, 1990, ANN STAT, V18, P568
[10]   MUTATIONAL STUDIES WITH THE TRP REPRESSOR OF ESCHERICHIA-COLI SUPPORT THE HELIX-TURN-HELIX MODEL OF REPRESSOR RECOGNITION OF OPERATOR DNA [J].
KELLEY, RL ;
YANOFSKY, C .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1985, 82 (02) :483-487