Alignments without low-scoring regions

被引:37
作者
Zhang, Z [1 ]
Berman, P [1 ]
Miller, W [1 ]
机构
[1] Penn State Univ, Dept Comp Sci & Engn, University Pk, PA 16802 USA
关键词
sequence alignment; Smith-Waterman algorithm; dynamic programming;
D O I
10.1089/cmb.1998.5.197
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Given a strong match between regions of two sequences, how far can the match be meaningfully extended if gaps are allowed in the resulting alignment? The aim is to avoid searching beyond the point that a useful extension of the alignment is likely to be found. Without loss of generality, we can restrict attention to the suffixes of the sequences that follow the strong match, which leads to the following formal problem. Given two sequences and a fixed X > O, align initial portions of the sequences subject to the constraint that no section of the alignment scores below -X, Our results indicate that computing an optimal alignment under this constraint is very expensive. However, less rigorous conditions on the alignment can be guaranteed by quite efficient algorithms. One of these variants has been implemented in a new release of the Blast suite of database search programs.
引用
收藏
页码:197 / 210
页数:14
相关论文
共 13 条
[11]   A GENERAL METHOD APPLICABLE TO SEARCH FOR SIMILARITIES IN AMINO ACID SEQUENCE OF 2 PROTEINS [J].
NEEDLEMAN, SB ;
WUNSCH, CD .
JOURNAL OF MOLECULAR BIOLOGY, 1970, 48 (03) :443-+
[12]  
SPOUGE JL, 1991, COMPUT APPL BIOSCI, V7, P1
[13]   Aligning a DNA sequence with a protein sequence [J].
Zhang, Z ;
Pearson, WR ;
Miller, W .
JOURNAL OF COMPUTATIONAL BIOLOGY, 1997, 4 (03) :339-349