An exact solution for the segment-to-segment multiple sequence alignment problem

被引:27
作者
Lenhof, HP
Morgenstern, B
Reinert, K
机构
[1] MPI Informat, D-66123 Saarbrucken, Germany
[2] Natl Res Ctr Environm & Hlth, Inst Biomath & Biometry, D-85764 Neuherberg, Germany
关键词
D O I
10.1093/bioinformatics/15.3.203
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Motivation: In molecular biology, sequence alignment is a crucial tool in studying the structure and function of molecules, as well as the evolution of species, In the segment-to-segment variation of the multiple alignment problem, the input can be seen as a set of non-gapped segment pairs (diagonals). Given a weight function that assigns a weight score to every possible diagonal, the goal is to choose a consistent set of diagonals of maximum weight. We show that the segment-to-segment multiple alignment problem is equivalent to a novel formulation of the Maximum Trace problem: the Generalized Maximum Trace (GMT) problem. Solving this problem to optimality, therefore, may improve upon the previous greedy strategies that are used for solving the segment-to-segment multiple sequence alignment problem. We show that the GMT can be stated in terms of an integer linear program and then solve the integer linear program using methods from polyhedral combinatorics. This leads to a branch-and-cut algorithm for segment-to-segment multiple sequence alignment. Results: We report on our first computational experiences with this novel method and show that the program is able to find optimal solutions for real-world test examples.
引用
收藏
页码:203 / 210
页数:8
相关论文
共 22 条
[1]  
Abdeddaïm S, 1997, LECT NOTES COMPUT SC, V1264, P167
[2]  
ALTSCHUL SF, 1986, B MATH BIOL, V48, P633, DOI 10.1016/S0092-8240(86)90012-1
[3]   BASIC LOCAL ALIGNMENT SEARCH TOOL [J].
ALTSCHUL, SF ;
GISH, W ;
MILLER, W ;
MYERS, EW ;
LIPMAN, DJ .
JOURNAL OF MOLECULAR BIOLOGY, 1990, 215 (03) :403-410
[4]  
[Anonymous], LECT NOTES COMPUTER
[5]   MAXIMUM-LIKELIHOOD ALIGNMENT OF DNA-SEQUENCES [J].
BISHOP, MJ ;
THOMPSON, EA .
JOURNAL OF MOLECULAR BIOLOGY, 1986, 190 (02) :159-165
[6]   A symmetric-iterated multiple alignment of protein sequences [J].
Brocchieri, L ;
Karlin, S .
JOURNAL OF MOLECULAR BIOLOGY, 1998, 276 (01) :249-264
[7]  
JUNGER M, 1997, 97260 U COL I INF
[8]  
Kececioglu JD, 1991, THESIS U ARIZONA
[9]   DETECTING SUBTLE SEQUENCE SIGNALS - A GIBBS SAMPLING STRATEGY FOR MULTIPLE ALIGNMENT [J].
LAWRENCE, CE ;
ALTSCHUL, SF ;
BOGUSKI, MS ;
LIU, JS ;
NEUWALD, AF ;
WOOTTON, JC .
SCIENCE, 1993, 262 (5131) :208-214
[10]  
MCCLURE MA, 1994, MOL BIOL EVOL, V11, P571