A branch-and-cut algorithm for multiple sequence alignment

被引:11
作者
Althaus, E [1 ]
Caprara, A
Lenhof, HP
Reinert, K
机构
[1] Univ Saarland, D-66123 Saarbrucken, Germany
[2] Univ Bologna, DEIS, I-40136 Bologna, Italy
[3] Free Univ Berlin, Inst Comp Sci, D-14195 Berlin, Germany
关键词
Hull; Mathematical Method; Polynomial Time; Convex Hull; Multiple Sequence Alignment;
D O I
10.1007/s10107-005-0659-3
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
We consider a branch-and-cut approach for solving the multiple sequence alignment problem, which is a central problem in computational biology. We propose a general model for this problem in which arbitrary gap costs are allowed. An interesting aspect of our approach is that the three (exponentially large) classes of natural valid inequalities that we consider turn out to be both facet-defining for the convex hull of integer solutions and separable in polynomial time. Both the proofs that these classes of valid inequalities are facet-defining and the description of the separation algorithms are far from trivial. Experimental results on several benchmark instances show that our method outperforms the best tools developed so far, in that it produces alignments that are better from a biological point of view. A noteworthy outcome of the results is the effectiveness of using branch-and-cut with only a carefully-selected subset of the variables as a heuristic.
引用
收藏
页码:387 / 425
页数:39
相关论文
共 26 条
[1]  
Achterberg T., 2004, 0419 ZUS I BERL
[2]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[3]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[4]  
[Anonymous], 4OR
[5]  
[Anonymous], 1997, ALGORITHMS STRINGS T, DOI DOI 10.1017/CBO9780511574931
[6]  
BIENSTOCK D, 2002, POTENTIAL FUNCTION M
[7]   Compact vs. exponential-size LP relaxations [J].
Carr, RD ;
Lancia, G .
OPERATIONS RESEARCH LETTERS, 2002, 30 (01) :57-65
[8]   THE MULTIPLE SEQUENCE ALIGNMENT PROBLEM IN BIOLOGY [J].
CARRILLO, H ;
LIPMAN, D .
SIAM JOURNAL ON APPLIED MATHEMATICS, 1988, 48 (05) :1073-1082
[9]  
DAYHOFF MO, 1979, ATLAS PROTEIN SEQ S3, V5, P345
[10]   Alignment of whole genomes [J].
Delcher, AL ;
Kasif, S ;
Fleischmann, RD ;
Peterson, J ;
White, O ;
Salzberg, SL .
NUCLEIC ACIDS RESEARCH, 1999, 27 (11) :2369-2376