TreeCmp: Comparison of Trees in Polynomial Time

被引:79
作者
Bogdanowicz, Damian [2 ]
Giaro, Krzysztof [2 ]
Wrobel, Borys [1 ,3 ]
机构
[1] Polish Acad Sci, Inst Oceanol, Syst Modelling Lab, Sopot, Poland
[2] Gdansk Univ Technol, Fac Elect Telecommun & Informat, Dept Algorithms & Syst Modelling, Gdansk, Poland
[3] Adam Mickiewicz Univ, Evolutionary Syst Lab, Poznan, Poland
来源
EVOLUTIONARY BIOINFORMATICS | 2012年 / 8卷
关键词
phylogenetics; tree metrics; tree comparison; Matching Split metric; Matching Cluster metric; MAXIMUM-LIKELIHOOD PHYLOGENIES; SCALING ALGORITHMS; QUARTET DISTANCE; ASSIGNMENT; DISTRIBUTIONS; EVOLUTION; DENSE;
D O I
10.4137/EBO.S9657
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
When a phylogenetic reconstruction does not result in one tree but in several, tree metrics permit finding out how far the reconstructed trees are from one another. They also permit to assess the accuracy of a reconstruction if a true tree is known. TreeCmp implements eight metrics that can be calculated in polynomial time for arbitrary (not only bifurcating) trees: four for unrooted (Matching Split metric, which we have recently proposed, Robinson-Foulds, Path Difference, Quartet) and four for rooted trees (Matching Cluster, Robinson-Foulds cluster, Nodal Splitted and Triple). TreeCmp is the first implementation of Matching Split/Cluster metrics and the first efficient and convenient implementation of Nodal Splitted. It allows to compare relatively large trees. We provide an example of the application of TreeCmp to compare the accuracy of ten approaches to phylogenetic reconstruction with trees up to 5000 external nodes, using a measure of accuracy based on normalized similarity between trees.
引用
收藏
页码:475 / 487
页数:13
相关论文
共 36 条
[1]  
Allen B. L., 2001, Annals of Combinatorics, V5, P1, DOI [10.1007/s00026-001-8006-8, DOI 10.1007/S00026-001-8006-8]
[2]  
[Anonymous], 2005, PHYLIP (phylogeny inference package) version 3.6
[3]   Comparing and aggregating partially resolved trees [J].
Bansal, Mukul S. ;
Dong, Jianrong ;
Fernandez-Baca, David .
THEORETICAL COMPUTER SCIENCE, 2011, 412 (48) :6634-6652
[4]  
Bogdanowicz D, 2012, IEEE ACM T COMPUT BI, V9, P150, DOI [10.1109/TCBB.2011.38, 10.1109/TCBB.2011.48]
[5]   Computing the quartet distance between evolutionary trees in time O(n log n) [J].
Brodal, GS ;
Fagerberg, R ;
Pedersen, CNS .
ALGORITHMICA, 2004, 38 (02) :377-395
[6]   Nodal distances for rooted phylogenetic trees [J].
Cardona, Gabriel ;
Llabres, Merce ;
Rossello, Francesc ;
Valiente, Gabriel .
JOURNAL OF MATHEMATICAL BIOLOGY, 2010, 61 (02) :253-276
[7]   Fast calculation of the quartet distance between trees of arbitrary degrees [J].
Christiansen, Chris ;
Mailund, Thomas ;
Pedersen, Christian N. S. ;
Randers, Martin ;
Stissing, Martin Stig .
ALGORITHMS FOR MOLECULAR BIOLOGY, 2006, 1 (1)
[8]   The triples distance for rooted bifurcating phylogenetic trees [J].
Critchlow, DE ;
Pearl, DK ;
Qian, CL .
SYSTEMATIC BIOLOGY, 1996, 45 (03) :323-334
[9]  
Dasgupta B., 1997, PROC DIMACS WORKSHOP, P125
[10]   OPTIMAL-ALGORITHMS FOR COMPARING TREES WITH LABELED LEAVES [J].
DAY, WHE .
JOURNAL OF CLASSIFICATION, 1985, 2 (01) :7-28