Comparative performance of Bayesian and AIC-based measures of phylogenetic model uncertainty

被引:68
作者
Alfaro, ME
Huelsenbeck, JP
机构
[1] Sch Biol Sci, Pullman, WA 99164 USA
[2] Sect Ecol Evolut & Behav, La Jolla, CA 92093 USA
关键词
AIC; akaike weights; Bayesian phylogenetics; model averaging; model selection; model uncertainty; posterior probability; reversible jump;
D O I
10.1080/10635150500433565
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Reversible-jump Markov chain Monte Carlo (RJ-MCMC) is a technique for simultaneously evaluating multiple related (but not necessarily nested) statistical models that has recently been applied to the problem of phylogenetic model selection. Here we use a simulation approach to assess the performance of this method and compare it to Akaike weights, a measure of model uncertainty that is based on the Akaike information criterion. Under conditions where the assumptions of the candidate models matched the generating conditions, both Bayesian and AIC-based methods perform well. The 95% credible interval contained the generating model close to 95% of the time. However, the size of the credible interval differed with the Bayesian credible set containing approximately 25% to 50% fewer models than an AIC-based credible interval. The posterior probability was a better indicator of the correct model than the Akaike weight when all assumptions were met but both measures performed similarly when some model assumptions were violated. Models in the Bayesian posterior distribution were also more similar to the generating model in their number of parameters and were less biased in their complexity. In contrast, Akaike-weighted models were more distant from the generating model and biased towards slightly greater complexity. The AIC-based credible interval appeared to be more robust to the violation of the rate homogeneity assumption. Both AIC and Bayesian approaches suggest that substantial uncertainty can accompany the choice of model for phylogenetic analyses, suggesting that alternative candidate models should be examined in analysis of phylogenetic data.
引用
收藏
页码:89 / 96
页数:8
相关论文
共 34 条
[1]   Accounting for uncertainty in the tree topology has little effect on the decision-theoretic approach to model selection in phylogeny estimation [J].
Abdo, Z ;
Minin, VN ;
Joyce, P ;
Sullivan, J .
MOLECULAR BIOLOGY AND EVOLUTION, 2005, 22 (03) :691-703
[2]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[3]   Exploring among-site rate variation models in a maximum likelihood framework using empirical data: Effects of model assumptions on estimates of topology, branch lengths, and bootstrap support [J].
Buckley, TR ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2001, 50 (01) :67-86
[4]   The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support [J].
Buckley, TR ;
Cunningham, CW .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (04) :394-405
[5]   Model misspecification and probabilistic tests of topology: Evidence from empirical data sets [J].
Buckley, TR .
SYSTEMATIC BIOLOGY, 2002, 51 (03) :509-523
[6]   Combined data, Bayesian phylogenetics, and the origin of the New Zealand cicada genera [J].
Buckley, TR ;
Arensburger, P ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2002, 51 (01) :4-18
[7]  
BURNHAM K.P., 2002, MODEL SELECTION MULT, P352, DOI DOI 10.1007/B97636
[8]   SUCCESS OF MAXIMUM-LIKELIHOOD PHYLOGENY INFERENCE IN THE 4-TAXON CASE [J].
GAUT, BS ;
LEWIS, PO .
MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (01) :152-162
[9]   STATISTICAL TESTS OF MODELS OF DNA SUBSTITUTION [J].
GOLDMAN, N .
JOURNAL OF MOLECULAR EVOLUTION, 1993, 36 (02) :182-198
[10]   Partition-distance: A problem and class of perfect graphs arising in clustering [J].
Gusfield, D .
INFORMATION PROCESSING LETTERS, 2002, 82 (03) :159-164