A fast algorithm for the computation and enumeration of perfect phylogenies

被引:47
作者
Kannan, S
Warnow, T
机构
[1] Dept. of Comp. and Info. Science, University of Pennsylvania, Philadelphia
关键词
evolutionary trees; perfect phylogeny; combinatorial enumeration; dynamic programming; polynomial delay algorithms;
D O I
10.1137/S0097539794279067
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The perfect phylogeny problem is a classical problem in computational evolutionary biology, in which a set of species/taxa is described by a set of qualitative characters. In recent years, the problem has been shown to be NP-complete in general, while the different fixed parameter versions can each be solved in polynomial time. In particular, Agarwala and Fernandez-Baca have developed an O(2(3r)(nk(3) + k(4))) algorithm for the perfect phylogeny problem for n species defined by k r-state characters [SIAM J. Comput., 23 (1994), pp. 1216-1224]. Since, commonly, the character data are drawn from alignments of molecular sequences, k is the length of the sequences and can thus be very large (in the hundreds or thousands). Thus, it is imperative to develop algorithms which run efficiently for large values of k. In this paper we make additional observations about the structure of the problem and produce an algorithm for the problem that runs in time O(2(2r)k(2)n). We also show how it is possible to efficiently build a structure that implicitly represents the set of all perfect phylogenies and to randomly sample from that set.
引用
收藏
页码:1749 / 1763
页数:15
相关论文
共 19 条
[1]   A POLYNOMIAL-TIME ALGORITHM FOR THE PERFECT PHYLOGENY PROBLEM WHEN THE NUMBER OF CHARACTER STATES IS FIXED [J].
AGARWALA, R ;
FERNANDEZBACA, D .
SIAM JOURNAL ON COMPUTING, 1994, 23 (06) :1216-1224
[2]  
AGARWALA R, 1994, 9451 DIMACS RUTG U
[3]  
[Anonymous], 1993, EFFICIENT ALGORITHMS
[4]   A SIMPLE LINEAR-TIME ALGORITHM FOR TRIANGULATING 3-COLORED GRAPHS [J].
BODLAENDER, H ;
KLOKS, T .
JOURNAL OF ALGORITHMS, 1993, 15 (01) :160-172
[5]  
Buneman P., 1974, Discrete Mathematics, V9, P205, DOI 10.1016/0012-365X(74)90002-8
[6]   COMPUTATIONAL-COMPLEXITY OF INFERRING PHYLOGENIES BY COMPATIBILITY [J].
DAY, WHE ;
SANKOFF, D .
SYSTEMATIC ZOOLOGY, 1986, 35 (02) :224-229
[7]   CONVEX TREE REALIZATIONS OF PARTITIONS [J].
DRESS, A ;
STEEL, M .
APPLIED MATHEMATICS LETTERS, 1992, 5 (03) :3-6
[8]  
ESTABROOK G F, 1975, Mathematical Biosciences, V23, P263, DOI 10.1016/0025-5564(75)90040-1
[9]  
Estabrook G.F., 1972, Annual Rev Ecol Syst, V3, P427, DOI 10.1146/annurev.es.03.110172.002235
[10]   EFFICIENT ALGORITHMS FOR INFERRING EVOLUTIONARY TREES [J].
GUSFIELD, D .
NETWORKS, 1991, 21 (01) :19-28