SUBOPTIMAL SEQUENCE ALIGNMENT IN MOLECULAR-BIOLOGY - ALIGNMENT WITH ERROR ANALYSIS

被引:69
作者
ZUKER, M
机构
[1] Institute for Biological Sciences National Research Council of Canada Ottawa
关键词
SEQUENCE COMPARISONS; ALIGNMENT SIGNIFICANCE; PROTEIN STRUCTURE SUPERPOSITION; DOT PLOT;
D O I
10.1016/0022-2836(91)80062-Y
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
A molecular sequence alignment algorithm based on dynamic programming has been extended to allow the computation of all pairs of residues that can be part of optimal and suboptimal sequence alignments. The uncertainties inherent in sequence alignment can be displayed using a new form of dot plot. The method allows the qualitative assessment of whether or not two sequences are related, and can reveal what parts of the alignment are better determined than others. It also permits the computation of representative optimal and suboptimal alignments. The relation between alignment reliability and alignment parameters is discussed. Other applications are to cyclical permutations of sequences and the detection of self-similarity. An application to multiple sequence alignment is noted. © 1991.
引用
收藏
页码:403 / 420
页数:18
相关论文
共 37 条
[1]  
ALTSCHUL SF, 1986, B MATH BIOL, V48, P603, DOI 10.1016/S0092-8240(86)90010-8
[2]   LOCALLY OPTIMAL SUBALIGNMENTS USING NONLINEAR SIMILARITY FUNCTIONS [J].
ALTSCHUL, SF ;
ERICKSON, BW .
BULLETIN OF MATHEMATICAL BIOLOGY, 1986, 48 (5-6) :633-660
[3]   AN EXTREME VALUE THEORY FOR SEQUENCE MATCHING [J].
ARRATIA, R ;
GORDON, L ;
WATERMAN, M .
ANNALS OF STATISTICS, 1986, 14 (03) :971-993
[4]  
BEACH RC, 1981, STANFORD LINEAR ACCE, V203
[5]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[6]  
BRUCCOLERI RE, 1988, COMPUT APPL BIOSCI, V4, P167
[7]   THE MULTIPLE SEQUENCE ALIGNMENT PROBLEM IN BIOLOGY [J].
CARRILLO, H ;
LIPMAN, D .
SIAM JOURNAL ON APPLIED MATHEMATICS, 1988, 48 (05) :1073-1082
[8]   A NEW PROTEIN-SEQUENCE DATA-BANK [J].
CLAVERIE, JM ;
SAUVAGET, I .
NATURE, 1985, 318 (6041) :19-19
[9]   APPLICATIONS OF PARALLEL PROCESSING ALGORITHMS FOR DNA-SEQUENCE ANALYSIS [J].
COLLINS, JF ;
COULSON, AFW .
NUCLEIC ACIDS RESEARCH, 1984, 12 (01) :181-192
[10]   PROTEIN AND NUCLEIC-ACID SEQUENCE DATABASE SEARCHING - A SUITABLE CASE FOR PARALLEL PROCESSING [J].
COULSON, AFW ;
COLLINS, JF ;
LYALL, A .
COMPUTER JOURNAL, 1987, 30 (05) :420-424