Comparing bootstrap and posterior probability values in the four-taxon case

被引:243
作者
Cummings, MP
Handley, SA
Myers, DS
Reed, DL
Rokas, A
Winka, K
机构
[1] Univ Maryland, Ctr Bioinformat & Computat Biol, College Pk, MD 20742 USA
[2] Washington Univ, Sch Med, Dept Mol Microbiol, St Louis, MO 63110 USA
[3] Pomona Coll, Claremont, CA 91711 USA
[4] Univ Utah, Dept Biol, Salt Lake City, UT 84112 USA
[5] Univ Wisconsin, Howard Hughes Med Inst, Madison, WI 53706 USA
[6] Univ Wisconsin, Mol Biol Lab, Madison, WI 53706 USA
[7] Umea Univ, Dept Ecol & Environm Sci, SE-90187 Umea, Sweden
基金
美国国家航空航天局; 美国国家科学基金会;
关键词
Bayesian analysis; Markov chain Monte Carlo sampling; maximum likelihood; phylogenetics; LIKELIHOOD PHYLOGENETIC ESTIMATION; DNA-SEQUENCE DATA; BAYESIAN PHYLOGENETICS; BASE SUBSTITUTIONS; EVOLUTIONARY RATES; MITOCHONDRIAL-DNA; INFERENCE; CONFIDENCE; TREES; SYSTEMATICS;
D O I
10.1080/10635150390218213
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Assessment of the reliability of a given phylogenetic hypothesis is an important step in phylogenetic analysis. Historically, the nonparametric bootstrap procedure has been the most frequently used method for assessing the support for specific phylogenetic relationships. The recent employment of Bayesian methods for phylogenetic inference problems has resulted in clade support being expressed in terms of posterior probabilities. We used simulated data and the four-taxon case to explore the relationship between nonparametric bootstrap values (as inferred by maximum likelihood) and posterior probabilities (as inferred by Bayesian analysis). The results suggest a complex association between the two measures. Three general regions of tree space can be identified: (1) the neutral zone, where differences between mean bootstrap and mean posterior probability values are not significant, (2) near the two-branch corner, and (3) deep in the two-branch corner. In the last two regions, significant differences occur between mean bootstrap and mean posterior probability values. Whether bootstrap or posterior probability values are higher depends on the data in support of alternative topologies. Examination of star topologies revealed that both bootstrap and posterior probability values differ significantly from theoretical expectations; in particular, there are more posterior probability values in the range 0.85-1 than expected by theory. Therefore, our results corroborate the findings of others that posterior probability values are excessively high. Our results also suggest that extrapolations from single topology branch-length studies are unlikely to provide any general conclusions regarding the relationship between bootstrap and posterior probability values.
引用
收藏
页码:477 / 487
页数:11
相关论文
共 50 条