Consequences of Common Topological Rearrangements for Partition Trees in Phylogenomic Inference

被引:12
作者
Chernomor, Olga [1 ,2 ]
Bui Quang Minh [1 ]
von Haeseler, Arndt [1 ,2 ]
机构
[1] Univ Vienna, Ctr Integrat Bioinformat Vienna, Max F Perutz Labs, Vienna, Austria
[2] Univ Vienna, Fac Comp Sci, Bioinformat & Computat Biol, Vienna, Austria
基金
奥地利科学基金会;
关键词
nearest neighbor interchange; partial terraces; phylogenetic terraces; subtree pruning and regrafting; tree bisection and reconnection; MAXIMUM-LIKELIHOOD PHYLOGENIES; SUPERMATRIX APPROACH; SEQUENCES; TERRACES; RATES; TIME; LIFE;
D O I
10.1089/cmb.2015.0146
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
In phylogenomic analysis the collection of trees with identical score (maximum likelihood or parsimony score) may hamper tree search algorithms. Such collections are coined phylogenetic terraces. For sparse supermatrices with a lot of missing data, the number of terraces and the number of trees on the terraces can be very large. If terraces are not taken into account, a lot of computation time might be unnecessarily spent to evaluate many trees that in fact have identical score. To save computation time during the tree search, it is worthwhile to quickly identify such cases. The score of a species tree is the sum of scores for all the so-called induced partition trees. Therefore, if the topological rearrangement applied to a species tree does not change the induced partition trees, the score of these partition trees is unchanged. Here, we provide the conditions under which the three most widely used topological rearrangements (nearest neighbor interchange, subtree pruning and regrafting, and tree bisection and reconnection) change the topologies of induced partition trees. During the tree search, these conditions allow us to quickly identify whether we can save computation time on the evaluation of newly encountered trees. We also introduce the concept of partial terraces and demonstrate that they occur more frequently than the original full terrace. Hence, partial terrace is the more important factor of timesaving compared to full terrace. Therefore, taking into account the above conditions and the partial terrace concept will help to speed up the tree search in phylogenomic inference.
引用
收藏
页码:1129 / 1142
页数:14
相关论文
共 24 条
[1]  
[Anonymous], 2004, Inferring phylogenies
[2]   The supermatrix approach to systematics [J].
de Queiroz, Alan ;
Gatesy, John .
TRENDS IN ECOLOGY & EVOLUTION, 2007, 22 (01) :34-41
[3]   SEPARATE VERSUS COMBINED ANALYSIS OF PHYLOGENETIC EVIDENCE [J].
DEQUEIROZ, A ;
DONOGHUE, MJ ;
KIM, J .
ANNUAL REVIEW OF ECOLOGY AND SYSTEMATICS, 1995, 26 :657-681
[4]   New Algorithms and Methods to Estimate Maximum-Likelihood Phylogenies: Assessing the Performance of PhyML 3.0 [J].
Guindon, Stephane ;
Dufayard, Jean-Francois ;
Lefort, Vincent ;
Anisimova, Maria ;
Hordijk, Wim ;
Gascuel, Olivier .
SYSTEMATIC BIOLOGY, 2010, 59 (03) :307-321
[5]  
Harding E.F., 1971, ADV APPL PROBAB, V3, P44, DOI DOI 10.2307/1426329
[6]   The bee tree of life: a supermatrix approach to apoid phylogeny and biogeography [J].
Hedtke, Shannon M. ;
Patiny, Sebastien ;
Danforth, Bryan N. .
BMC EVOLUTIONARY BIOLOGY, 2013, 13
[7]   Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood [J].
Hordijk, W ;
Gascuel, O .
BIOINFORMATICS, 2005, 21 (24) :4338-4347
[8]   IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies [J].
Lam-Tung Nguyen ;
Schmidt, Heiko A. ;
von Haeseler, Arndt ;
Bui Quang Minh .
MOLECULAR BIOLOGY AND EVOLUTION, 2015, 32 (01) :268-274
[9]   A NEW METHOD FOR CALCULATING EVOLUTIONARY SUBSTITUTION RATES [J].
LANAVE, C ;
PREPARATA, G ;
SACCONE, C ;
SERIO, G .
JOURNAL OF MOLECULAR EVOLUTION, 1984, 20 (01) :86-93
[10]   Updating the evolutionary history of Carnivora (Mammalia): a new species-level supertree complete with divergence time estimates [J].
Nyakatura, Katrin ;
Bininda-Emonds, Olaf R. P. .
BMC BIOLOGY, 2012, 10