Computing Bayes factors using thermodynamic integration

被引:494
作者
Lartillot, N
Philippe, H
机构
[1] Univ Montpellier 2, CNRS, Lab Informat Robot & Microelect Montpellier, UMR 5506, F-34392 Montpellier 5, France
[2] Univ Montreal, Dept Biochim, Canadian Inst Adv Res, Montreal, PQ H3C 3J7, Canada
关键词
Bayes factor; harmonic mean; mixture model; path sampling; phylogeny; thermodynamic integration;
D O I
10.1080/10635150500433722
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
In the Bayesian paradigm, a common method for comparing two models is to compute the Bayes factor, defined as the ratio of their respective marginal likelihoods. In recent phylogenetic works, the numerical evaluation of marginal likelihoods has often been performed using the harmonic mean estimation procedure. In the present article, we propose to employ another method, based on an analogy with statistical physics, called thermodynamic integration. We describe the method, propose an implementation, and show on two analytical examples that this numerical method yields reliable estimates. In contrast, the harmonic mean estimator leads to a strong overestimation of the marginal likelihood, which is all the more pronounced as the model is higher dimensional. As a result, the harmonic mean estimator systematically favors more parameter-rich models, an artefact that might explain some recent puzzling observations, based on harmonic mean estimates, suggesting that Bayes factors tend to overscore complex models. Finally, we apply our method to the comparison of several alternative models of amino-acid replacement. We confirm our previous observations, indicating that modeling pattern heterogeneity across sites tends to yield better models than standard empirical matrices.
引用
收藏
页码:195 / 207
页数:13
相关论文
共 57 条
[11]   Marginal likelihood from the Metropolis-Hastings output [J].
Chib, S ;
Jeliazkov, I .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (453) :270-281
[12]   EVOLUTIONARY TREES FROM DNA-SEQUENCES - A MAXIMUM-LIKELIHOOD APPROACH [J].
FELSENSTEIN, J .
JOURNAL OF MOLECULAR EVOLUTION, 1981, 17 (06) :368-376
[13]   SUCCESS OF MAXIMUM-LIKELIHOOD PHYLOGENY INFERENCE IN THE 4-TAXON CASE [J].
GAUT, BS ;
LEWIS, PO .
MOLECULAR BIOLOGY AND EVOLUTION, 1995, 12 (01) :152-162
[14]  
Gelman A, 1998, STAT SCI, V13, P163
[15]  
Gelman A, 1996, STAT SINICA, V6, P733
[16]  
Geyer CJ, 1992, STAT SCI, V7, P473, DOI [DOI 10.1214/SS/1177011137, 10.1214/ss/1177011137]
[17]   Reversible jump Markov chain Monte Carlo computation and Bayesian model determination [J].
Green, PJ .
BIOMETRIKA, 1995, 82 (04) :711-732
[18]  
HAN C, 2000, BIOMETRIKA, V82, P711
[19]   Phylogeny estimation: Traditional and Bayesian approaches [J].
Holder, M ;
Lewis, PO .
NATURE REVIEWS GENETICS, 2003, 4 (04) :275-284
[20]   Potential applications and pitfalls of Bayesian inference of phylogeny [J].
Huelsenbeck, JP ;
Larget, B ;
Miller, RE ;
Ronquist, F .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :673-688