A Lagrangian relaxation approach for the multiple sequence alignment problem

被引:6
作者
Althaus, Ernst [1 ]
Canzar, Stefan [1 ]
机构
[1] Max Planck Inst Informat, D-66123 Saarbrucken, Germany
关键词
sequence comparison; Lagrangian relaxation; branch and bound;
D O I
10.1007/s10878-008-9139-z
中图分类号
TP39 [计算机的应用];
学科分类号
081203 [计算机应用技术]; 0835 [软件工程];
摘要
We present a branch-and-bound (bb) algorithm for the multiple sequence alignment problem (MSA), one of the most important problems in computational biology. The upper bound at each bb node is based on a Lagrangian relaxation of an integer linear programming formulation for MSA. Dualizing certain inequalities, the Lagrangian subproblem becomes a pairwise alignment problem, which can be solved efficiently by a dynamic programming approach. Due to a reformulation w.r.t. additionally introduced variables prior to relaxation we improve the convergence rate dramatically while at the same time being able to solve the Lagrangian problem efficiently. Our experiments show that our implementation, although preliminary, outperforms all exact algorithms for the multiple sequence alignment problem. Furthermore, the quality of the alignments is among the best computed so far.
引用
收藏
页码:127 / 154
页数:28
相关论文
共 27 条
[1]
A branch-and-cut algorithm for multiple sequence alignment [J].
Althaus, E ;
Caprara, A ;
Lenhof, HP ;
Reinert, K .
MATHEMATICAL PROGRAMMING, 2006, 105 (2-3) :387-425
[2]
Multiple sequence alignment with arbitrary gap costs: Computing an optimal solution using polyhedral combinatorics [J].
Althaus, E ;
Caprara, A ;
Lenhof, HP ;
Reinert, K .
BIOINFORMATICS, 2002, 18 :S4-S16
[3]
[Anonymous], MODERN HEURISTIC TEC
[4]
A heuristic method for the set covering problem [J].
Caprara, A ;
Fischetti, M ;
Toth, P .
OPERATIONS RESEARCH, 1999, 47 (05) :730-743
[5]
THE MULTIPLE SEQUENCE ALIGNMENT PROBLEM IN BIOLOGY [J].
CARRILLO, H ;
LIPMAN, D .
SIAM JOURNAL ON APPLIED MATHEMATICS, 1988, 48 (05) :1073-1082
[6]
Alignment of whole genomes [J].
Delcher, AL ;
Kasif, S ;
Fleischmann, RD ;
Peterson, J ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (11) :2369-2376
[7]
MUSCLE: multiple sequence alignment with high accuracy and high throughput [J].
Edgar, RC .
NUCLEIC ACIDS RESEARCH, 2004, 32 (05) :1792-1797
[8]
Elias I, 2003, LECT NOTES COMPUT SC, V2906, P352
[9]
SEQUENCE COMPARISON WITH MIXED CONVEX AND CONCAVE COSTS [J].
EPPSTEIN, D .
JOURNAL OF ALGORITHMS, 1990, 11 (01) :85-101
[10]