IQ-TREE: A Fast and Effective Stochastic Algorithm for Estimating Maximum-Likelihood Phylogenies

被引:17967
作者
Lam-Tung Nguyen [1 ,2 ]
Schmidt, Heiko A. [1 ]
von Haeseler, Arndt [1 ,2 ]
Bui Quang Minh [1 ]
机构
[1] Med Univ Vienna, Univ Vienna, Max F Perutz Labs, Ctr Integrat Bioinformat Vienna, Vienna, Austria
[2] Univ Vienna, Fac Comp Sci, Vienna, Austria
基金
奥地利科学基金会;
关键词
phylogenetic inference; phylogeny; maximum likelihood; stochastic algorithm; EVOLUTIONARY TREES; GENETIC ALGORITHM; DNA-SEQUENCES; EFFICIENCY; SPACE; MODEL;
D O I
10.1093/molbev/msu300
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Large phylogenomics data sets require fast tree inference methods, especially for maximum-likelihood (ML) phylogenies. Fast programs exist, but due to inherent heuristics to find optimal trees, it is not clear whether the best tree is found. Thus, there is need for additional approaches that employ different search strategies to find ML trees and that are at the same time as fast as currently available ML programs. We show that a combination of hill-climbing approaches and a stochastic perturbation method can be time-efficiently implemented. If we allow the same CPU time as RAxML and PhyML, then our software IQ-TREE found higher likelihoods between 62.2% and 87.1% of the studied alignments, thus efficiently exploring the tree-space. If we use the IQ-TREE stopping rule, RAxML and PhyML are faster in 75.7% and 47.1% of the DNA alignments and 42.2% and 100% of the protein alignments, respectively. However, the range of obtaining higher likelihoods with IQ-TREE improves to 73.3-97.1%. IQ-TREE is freely available at http://www.cibiv.at/software/iqtree.
引用
收藏
页码:268 / 274
页数:7
相关论文
共 30 条
[21]   RAxML-VI-HPC: Maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models [J].
Stamatakis, Alexandros .
BIOINFORMATICS, 2006, 22 (21) :2688-2690
[22]  
Swofford D., 1993, PAUP: Phylogenetic Analysis Using Parsimony
[23]  
Swofford David L., 1990, P411
[24]   IQPNNI: Moving fast through tree space and stopping in time [J].
Vinh, LS ;
von Haeseler, A .
MOLECULAR BIOLOGY AND EVOLUTION, 2004, 21 (08) :1565-1571
[25]   Accelerated likelihood surface exploration: The likelihood ratchet [J].
Vos, RA .
SYSTEMATIC BIOLOGY, 2003, 52 (03) :368-373
[26]   A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach [J].
Whelan, S ;
Goldman, N .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (05) :691-699
[27]   The Prevalence of Multifurcations in Tree-space and Their Implications for Tree-search [J].
Whelan, Simon ;
Money, Daniel .
MOLECULAR BIOLOGY AND EVOLUTION, 2010, 27 (12) :2674-2677
[28]   Maximum likelihood estimation on large phylogenies and analysis of adaptive evolution in human influenza virus A [J].
Yang, ZH .
JOURNAL OF MOLECULAR EVOLUTION, 2000, 51 (05) :423-432
[29]   MAXIMUM-LIKELIHOOD PHYLOGENETIC ESTIMATION FROM DNA-SEQUENCES WITH VARIABLE RATES OVER SITES - APPROXIMATE METHODS [J].
YANG, ZH .
JOURNAL OF MOLECULAR EVOLUTION, 1994, 39 (03) :306-314
[30]  
Zwickl D.J., 2006, GARLI GENETIC ALGORI