Ultrafast Approximation for Phylogenetic Bootstrap

被引:3470
作者
Bui Quang Minh [1 ]
Minh Anh Thi Nguyen [2 ]
von Haeseler, Arndt [1 ]
机构
[1] Med Univ Vienna, Univ Vienna, Max F Perutz Labs, Ctr Integrat Bioinformat Vienna, Vienna, Austria
[2] Univ Groningen, Groningen Bioinformat Ctr, Groningen, Netherlands
基金
奥地利科学基金会;
关键词
phylogenetic inference; nonparametric bootstrap; tree reconstruction; maximum likelihood; DNA-SEQUENCES; TREE-SPACE; MODEL; EVOLUTION; INFERENCE; PROTEIN; PERFORMANCE; SATURATION; CONFIDENCE; ALGORITHM;
D O I
10.1093/molbev/mst024
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Nonparametric bootstrap has been a widely used tool in phylogenetic analysis to assess the clade support of phylogenetic trees. However, with the rapidly growing amount of data, this task remains a computational bottleneck. Recently, approximation methods such as the RAxML rapid bootstrap (RBS) and the Shimodaira-Hasegawa-like approximate likelihood ratio test have been introduced to speed up the bootstrap. Here, we suggest an ultrafast bootstrap approximation approach (UFBoot) to compute the support of phylogenetic groups in maximum likelihood (ML) based trees. To achieve this, we combine the resampling estimated log-likelihood method with a simple but effective collection scheme of candidate trees. We also propose a stopping rule that assesses the convergence of branch support values to automatically determine when to stop collecting candidate trees. UFBoot achieves a median speed up of 3.1 (range: 0.66-33.3) to 10.2 (range: 1.32-41.4) compared with RAxML RBS for real DNA and amino acid alignments, respectively. Moreover, our extensive simulations show that UFBoot is robust against moderate model violations and the support values obtained appear to be relatively unbiased compared with the conservative standard bootstrap. This provides a more direct interpretation of the bootstrap support. We offer an efficient and easy-to-use software (available at http://www.cibiv.at/software/iqtree) to perform the UFBoot analysis with ML tree inference.
引用
收藏
页码:1188 / 1195
页数:8
相关论文
共 51 条
[1]  
Adachi J, 1996, MOLPHY VERSION 2 3 P
[2]   Approximate likelihood-ratio test for branches: A fast, accurate, and powerful alternative [J].
Anisimova, Maria ;
Gascuel, Olivier .
SYSTEMATIC BIOLOGY, 2006, 55 (04) :539-552
[3]   Survey of Branch Support Methods Demonstrates Accuracy, Power, and Robustness of Fast Likelihood-based Approximation Schemes [J].
Anisimova, Maria ;
Gil, Manuel ;
Dufayard, Jean-Francois ;
Dessimoz, Christophe ;
Gascuel, Olivier .
SYSTEMATIC BIOLOGY, 2011, 60 (05) :685-699
[4]  
[Anonymous], 2006, THESIS U TEXAS AUSTI
[5]   ProtTest 3: fast selection of best-fit models of protein evolution [J].
Darriba, Diego ;
Taboada, Guillermo L. ;
Doallo, Ramon ;
Posada, David .
BIOINFORMATICS, 2011, 27 (08) :1164-1165
[6]   Comparison of Bayesian and maximum likelihood bootstrap measures of phylogenetic reliability [J].
Douady, CJ ;
Delsuc, F ;
Boucher, Y ;
Doolittle, WF ;
Douzery, EJP .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (02) :248-254
[7]   Bayesian Phylogenetics with BEAUti and the BEAST 1.7 [J].
Drummond, Alexei J. ;
Suchard, Marc A. ;
Xie, Dong ;
Rambaut, Andrew .
MOLECULAR BIOLOGY AND EVOLUTION, 2012, 29 (08) :1969-1973
[8]   Bootstrap confidence levels for phylogenetic trees (vol 93, pg 7085, 1996) [J].
Efron, B ;
Halloran, E ;
Holmes, S .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1996, 93 (23) :13429-13434
[9]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26
[10]   IS THERE SOMETHING WRONG WITH THE BOOTSTRAP ON PHYLOGENIES - A REPLY [J].
FELSENSTEIN, J ;
KISHINO, H .
SYSTEMATIC BIOLOGY, 1993, 42 (02) :193-200