Fast calculation of the quartet distance between trees of arbitrary degrees

被引:14
作者
Christiansen, Chris [1 ]
Mailund, Thomas [2 ]
Pedersen, Christian N. S. [1 ,3 ]
Randers, Martin [1 ]
Stissing, Martin Stig [1 ]
机构
[1] Univ Aarhus, Dept Comp Sci, DK-8200 Aarhus N, Denmark
[2] Univ Oxford, Dept Stat, Oxford OX1 3TG, England
[3] Univ Aarhus, Bioinformat Res Ctr, DK-8000 Aarhus C, Denmark
关键词
Binary Tree; Internal Node; Random Tree; Input Tree; Arbitrary Degree;
D O I
10.1186/1748-7188-1-16
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: A number of algorithms have been developed for calculating the quartet distance between two evolutionary trees on the same set of species. The quartet distance is the number of quartets - sub-trees induced by four leaves - that differs between the trees. Mostly, these algorithms are restricted to work on binary trees, but recently we have developed algorithms that work on trees of arbitrary degree. Results: We present a fast algorithm for computing the quartet distance between trees of arbitrary degree. Given input trees T and T', the algorithm runs in time O(n + vertical bar V vertical bar.vertical bar V'vertical bar min{id, id'}) and space O(n + vertical bar V vertical bar.vertical bar V'vertical bar), where n is the number of leaves in the two trees, V and V are the non-leaf nodes in T and T', respectively, and id and id' are the maximal number of non- leaf nodes adjacent to a non- leaf node in T and T', respectively. The fastest algorithms previously published for arbitrary degree trees run in O(n(3)) (independent of the degree of the tree) and O(vertical bar V vertical bar.|V'vertical bar.id.id'), respectively. We experimentally compare the algorithm with existing algorithms for computing the quartet distance for general trees. Conclusion: We present a new algorithm for computing the quartet distance between two trees of arbitrary degree. The new algorithm improves the asymptotic running time for computing the quartet distance, compared to previous methods, and experimental results indicate that the new method also performs significantly better in practice.
引用
收藏
页数:13
相关论文
共 12 条
[1]  
Allen B. L., 2001, ANN COMB, V5, P1
[2]   RBT - a tool for building refined Buneman trees [J].
Besenbacher, S ;
Mailund, T ;
Westh-Nielsen, L ;
Pedersen, CNS .
BIOINFORMATICS, 2005, 21 (08) :1711-1712
[3]   Computing the quartet distance between evolutionary trees in time O(n log n) [J].
Brodal, GS ;
Fagerberg, R ;
Pedersen, CNS .
ALGORITHMICA, 2004, 38 (02) :377-395
[4]  
Bryant D, 2000, PROCEEDINGS OF THE ELEVENTH ANNUAL ACM-SIAM SYMPOSIUM ON DISCRETE ALGORITHMS, P285
[5]  
Christiansen C, 2005, LECT NOTES COMPUT SC, V3692, P77
[6]  
DOUCETTE CR, THESIS MEMORIAL U NE
[7]   COMPARISON OF UNDIRECTED PHYLOGENETIC TREES BASED ON SUBTREES OF 4 EVOLUTIONARY UNITS [J].
ESTABROOK, GF ;
MCMORRIS, FR ;
MEACHAM, CA .
SYSTEMATIC ZOOLOGY, 1985, 34 (02) :193-200
[8]  
Felsenstein Joseph, 2004, Inferring_phylogenies, V2
[9]  
Robinson D., 1979, LECT NOTES MATH, V748, P119, DOI DOI 10.1007/BFB0102678
[10]   COMPARISON OF PHYLOGENETIC TREES [J].
ROBINSON, DF ;
FOULDS, LR .
MATHEMATICAL BIOSCIENCES, 1981, 53 (1-2) :131-147