Model misspecification and probabilistic tests of topology: Evidence from empirical data sets

被引:186
作者
Buckley, TR
机构
[1] Landcare Res, Auckland, New Zealand
[2] Duke Univ, Dept Biol, Durham, NC USA
关键词
Bayesian statistics; Markov chain Monte Carlo; maximum likelihood; nucleotide substitution models; parametric bootstrapping; SH test; SOWH test; statistical tests;
D O I
10.1080/10635150290069922
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Probabilistic tests of topology offer a powerful means of evaluating competing phylogenetic hypotheses. The performance of the nonparametric Shimodaira-Hasegawa (SH) test, the parametric Swofford-Olsen-Waddell-Hillis (SOWH) test, and Bayesian posterior probabilities were explored for five data sets for which all the phylogenetic relationships are known with a very high degree of certainty. These results are consistent with previous simulation studies that have indicated a tendency for the SOWH test to be prone to generating Type 1 errors because of model misspecification coupled with branch length heterogeneity. These results also suggest that the SOWH test may accord overconfidence in the true topology when the null hypothesis is in fact correct. In contrast, the SH test was observed to be much more conservative, even under high substitution rates and branch length heterogeneity. For some of those data sets where the SOWH test proved misleading, the Bayesian posterior probabilities were also misleading. The results of all tests were strongly influenced by the exact substitution model assumptions. Simple models, especially those that assume rate homogeneity among sites, had a higher Type 1 error rate and were more likely to generate misleading posterior probabilities. For some of these data sets, the commonly used substitution models appear to be inadequate for estimating appropriate levels of uncertainty with the SOWH test and Bayesian methods. Reasons for the differences in statistical power between the two maximum likelihood tests are discussed and are contrasted with the Bayesian approach.
引用
收藏
页码:509 / 523
页数:15
相关论文
共 84 条
[1]   Testing the hypothesis of a recombinant origin of human immunodeficiency virus type 1 subtype E [J].
Anderson, JP ;
Rodrigo, AG ;
Learn, GH ;
Madan, A ;
Delahunty, C ;
Coon, M ;
Girard, M ;
Osmanov, S ;
Hood, L ;
Mullins, JI .
JOURNAL OF VIROLOGY, 2000, 74 (22) :10752-10765
[2]  
[Anonymous], P R SOC LOND B
[3]   Gene translocation links insects and crustaceans [J].
Boore, JL ;
Lavrov, DV ;
Brown, WM .
NATURE, 1998, 392 (6677) :667-668
[4]   Evaluating hypotheses on the origin and evolution of the New Zealand alpine cicadas (maoricicada) using multiple-comparison tests of tree topology [J].
Buckley, TR ;
Simon, C ;
Shimodaira, H ;
Chambers, GK .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (02) :223-234
[5]   The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support [J].
Buckley, TR ;
Cunningham, CW .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (04) :394-405
[6]   Combined data, Bayesian phylogenetics, and the origin of the New Zealand cicada genera [J].
Buckley, TR ;
Arensburger, P ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2002, 51 (01) :4-18
[7]  
Burnham K. P., 1998, MODEL SELECTION INFE
[8]   The complete mitochondrial DNA sequence of the shark Mustelus manazo:: Evaluating rooting contradictions to living bony vertebrates [J].
Cao, Y ;
Waddell, PJ ;
Okada, N ;
Hasegawa, M .
MOLECULAR BIOLOGY AND EVOLUTION, 1998, 15 (12) :1637-1646
[9]  
Clark MA, 2000, EVOLUTION, V54, P517, DOI 10.1111/j.0014-3820.2000.tb00054.x
[10]   Avian evolution, Gondwana biogeography and the Cretaceous-Tertiary mass extinction event [J].
Cracraft, J .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2001, 268 (1466) :459-469