Alignments without low-scoring regions

被引:37
作者
Zhang, Z [1 ]
Berman, P [1 ]
Miller, W [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
关键词
sequence alignment; Smith-Waterman algorithm; dynamic programming;
D O I
10.1089/cmb.1998.5.197
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Given a strong match between regions of two sequences, how far can the match be meaningfully extended if gaps are allowed in the resulting alignment? The aim is to avoid searching beyond the point that a useful extension of the alignment is likely to be found. Without loss of generality, we can restrict attention to the suffixes of the sequences that follow the strong match, which leads to the following formal problem. Given two sequences and a fixed X > O, align initial portions of the sequences subject to the constraint that no section of the alignment scores below -X, Our results indicate that computing an optimal alignment under this constraint is very expensive. However, less rigorous conditions on the alignment can be guaranteed by quite efficient algorithms. One of these variants has been implemented in a new release of the Blast suite of database search programs.
引用
收藏
页码:197 / 210
页数:14
相关论文
共 13 条
[1]  
ALTSCHUL S, 1997, IN PRESS PROTEINS
[2]   Gapped BLAST and PSI-BLAST: a new generation of protein database search programs [J].
Altschul, SF ;
Madden, TL ;
Schaffer, AA ;
Zhang, JH ;
Zhang, Z ;
Miller, W ;
Lipman, DJ .
NUCLEIC ACIDS RESEARCH, 1997, 25 (17) :3389-3402
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]  
Chao K M, 1994, J Comput Biol, V1, P271, DOI 10.1089/cmb.1994.1.271
[5]  
Chao KM, 1997, COMPUT APPL BIOSCI, V13, P75
[6]  
CHAO KM, 1992, COMPUT APPL BIOSCI, V8, P481
[7]   CONSTRAINED SEQUENCE ALIGNMENT [J].
CHAO, KM ;
HARDISON, RC ;
MILLER, W .
BULLETIN OF MATHEMATICAL BIOLOGY, 1993, 55 (03) :503-524
[8]   AN IMPROVED ALGORITHM FOR MATCHING BIOLOGICAL SEQUENCES [J].
GOTOH, O .
JOURNAL OF MOLECULAR BIOLOGY, 1982, 162 (03) :705-708
[9]  
JOHNSON DB, 1982, MATH SYST THEORY, V15, P295
[10]  
MYERS EW, 1989, B MATH BIOL, V51, P5, DOI 10.1007/BF02458834