A simple and space-efficient fragment-chaining algorithm for alignment of DNA and protein sequences

被引:32
作者
Morgenstern, B [1 ]
机构
[1] GSF Natl Res Ctr Environm & Hlth, Inst Biomath & Biometry, D-85764 Neuherberg, Germany
关键词
sequence alignment; string algorithm; fragment chaining; dynamic programming; complexity;
D O I
10.1016/S0893-9659(01)00085-4
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
In the segment-based approach to sequence alignment, nucleic acid, and protein sequence alignments are constructed from fragments, i.e., from pairs of ungapped segments of the input sequences. Given a set F of candidate fragments and a weighting function w : F --> R-0(+), the score of an alignment is defined as the sum of weights of the fragments it consists of, and the optimization problem is to find a consistent collection of pairwise disjoint fragments with maximum sum of weights. Herein, a sparse dynamic programming algorithm is described that solves the pairwise segment-alignment problem in O(L + N-max) space where L is the maximum length of the input sequences while N-max less than or equal to #F holds. With a recently introduced weighting function w, small sets F of candidate fragments are sufficient to obtain alignments of high quality, As a result, the proposed algorithm runs in essentially linear space. (C) 2001 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:11 / 16
页数:6
相关论文
共 14 条
[1]  
ABDEDDAIM S, IN PRESS LECT NOTES
[2]   LINEAR-SPACE ALGORITHMS THAT BUILD LOCAL ALIGNMENTS FROM FRAGMENTS [J].
CHAO, KM ;
MILLER, W .
ALGORITHMICA, 1995, 13 (1-2) :106-134
[3]   SPARSE DYNAMIC-PROGRAMMING .1. LINEAR COST-FUNCTIONS [J].
EPPSTEIN, D ;
GALIL, Z ;
GIANCARLO, R ;
ITALIANO, GF .
JOURNAL OF THE ACM, 1992, 39 (03) :519-545
[4]  
HIRSCHBERG DS, 1975, COMMUN ACM, V18, P314
[5]   A polyhedral approach to sequence alignment problems [J].
Kececioglu, JD ;
Lenhof, HP ;
Mehlhorn, K ;
Mutzel, P ;
Reinert, K ;
Vingron, M .
DISCRETE APPLIED MATHEMATICS, 2000, 104 (1-3) :143-186
[6]   An exact solution for the segment-to-segment multiple sequence alignment problem [J].
Lenhof, HP ;
Morgenstern, B ;
Reinert, K .
BIOINFORMATICS, 1999, 15 (03) :203-210
[7]   DIALIGN 2: improvement of the segment-to-segment approach to multiple sequence alignment [J].
Morgenstern, B .
BIOINFORMATICS, 1999, 15 (03) :211-218
[8]   Multiple DNA and protein sequence alignment based on segment-to-segment comparison [J].
Morgenstern, B ;
Dress, A ;
Werner, T .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (22) :12098-12103
[9]  
MORGENSTERN B, BIOINFORMATICS, V16, P948
[10]  
MORGENSTERN B, 1999, 133 U BIEL