Maximum agreement subtree in a set of evolutionary trees: Metrics and efficient algorithms

被引:74
作者
Amir, A
Keselman, D
机构
[1] BAR ILAN UNIV,IL-52100 RAMAT GAN,ISRAEL
[2] BAR ILAN UNIV,IL-52100 RAMAT GAN,ISRAEL
关键词
evolutionary trees; maximum agreement subtrees; classification;
D O I
10.1137/S0097539794269461
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The maximum agreement subtree approach is one method of reconciling different evolutionary trees for the same set of species. An agreement subtree enables choosing a subset of the species for whom the restricted subtree is equivalent (under a suitable definition) in all given evolutionary trees. Recently, dynamic programming ideas were used to provide polynomial time algorithms for finding a maximum homeomorphic agreement subtree of two trees. Generalizing these methods to sets of more than two trees yields algorithms that are exponential in the number of trees. Unfortunately, it turns out that in reality one is usually presented with more than two trees, sometimes as many as thousands of trees. In this paper we prove that the maximum homeomorphic agreement subtree problem is NP-complete for three trees with unbounded degrees. We then show an approximation algorithm of time O(kn(5)) for choosing the species that are not in a maximum agreement subtree of a set of k trees. Our approximation is guaranteed to provide a set that is no more than 4 times the optimum solution. While the set of evolutionary trees may be large in practice, the trees usually have very small degrees, typically no larger than three. We develop a new method for finding a maximum agreement subtree of k trees, of which one has degree bounded by d. This new method enables us to find a maximum agreement subtree in time O(kn(d+1) + n(2d)).
引用
收藏
页码:1656 / 1669
页数:14
相关论文
共 18 条
[1]  
[Anonymous], 1979, Computers and Intractablity: A Guide to the Theoryof NP-Completeness
[2]   RECONSTRUCTING THE SHAPE OF A TREE FROM OBSERVED DISSIMILARITY DATA [J].
BANDELT, HJ ;
DRESS, A .
ADVANCES IN APPLIED MATHEMATICS, 1986, 7 (03) :309-343
[3]   OPTIMAL-ALGORITHMS FOR COMPARING TREES WITH LABELED LEAVES [J].
DAY, WHE .
JOURNAL OF CLASSIFICATION, 1985, 2 (01) :7-28
[4]  
Farach M., 1993, Proceedings of the Twenty-Fifth Annual ACM Symposium on the Theory of Computing, P137, DOI 10.1145/167088.167132
[5]  
FARACH M, 1994, PROCEEDINGS OF THE FIFTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P481
[6]  
Farach M., 1995, P 2 EUR S ALG, P381
[7]   OBTAINING COMMON PRUNED TREES [J].
FINDEN, CR ;
GORDON, AD .
JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) :255-276
[8]  
GODDARD W, 1993, AGREEMENT SUBTREES M
[9]  
GORDON AD, 1979, BIOMETRIKA, V66, P7, DOI 10.1093/biomet/66.1.7
[10]  
Gordon AD., 1980, Analyse de donnees et informatique, P149