Least and most powerful phylogenetic tests to elucidate the origin of the seed plants in the presence of conflicting signals under misspecified models

被引:18
作者
Aris-Brosou, S [1 ]
机构
[1] N Carolina State Univ, Bioinformat Res Ctr, Raleigh, NC 27695 USA
基金
美国国家科学基金会;
关键词
approximately unbiased test; bootstrap proportion; hypothesis test; P-value adjustment; Shimodaira-Hasegawa test; significance test;
D O I
10.1080/10635150390258949
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Several tests of molecular phylogenies have been proposed over the last decades, but most of them lead to strikingly different P- values. I propose that such discrepancies are principally due to different forms of null hypotheses. To support this hypothesis, two new tests are described. Both consider the composite null hypothesis that all the topologies are equidistant from the true but unknown topology. This composite hypothesis can either be reduced to the simple hypothesis at the least favorable distribution ( frequentist significance test [ FST]) or to the maximum likelihood topology ( frequentist hypothesis test [ FHT]). In both cases, the reduced null hypothesis is tested against each topology included in the analysis. The tests proposed have an information- theoretic justification, and the distribution of their test statistic is estimated by a nonparametric bootstrap, adjusting P- values for multiple comparisons. I applied the new tests to the reanalysis of two chloroplast genes, psaA and psbB, and compared the results with those of previously described tests. As expected, the FST and the FHT behaved approximately like the Shimodaira - Hasegawa test and the bootstrap, respectively. Although the tests give overconfidence in a wrong tree when an overly simple nucleotide substitution model is assumed, more complex models incorporating heterogeneity among codon positions resolve some conflicts. To further investigate the influence of the null hypothesis, a power study was conducted. Simulations showed that FST and the Shimodaira - Hasegawa test are the least powerful and FHT is the most powerful across the parameter space. Although the size of all the tests is affected by misspecification, the two new tests appear more robust against misspecification of the model of evolution and consistently supported the hypothesis that the Gnetales are nested within gymnosperms.
引用
收藏
页码:781 / 793
页数:13
相关论文
共 64 条
[1]  
[Anonymous], 1993, Resampling-based multiple testing: Examples and methods for P-value adjustment
[2]   How Bayes tests of molecular phylogenies compare with frequentist approaches [J].
Aris-Brosou, S .
BIOINFORMATICS, 2003, 19 (05) :618-624
[3]   Effects of models of rate evolution on estimation of divergence dates with special reference to the metazoan 18S ribosomal RNA Phylogeny [J].
Aris-Brosou, S ;
Yang, ZH .
SYSTEMATIC BIOLOGY, 2002, 51 (05) :703-714
[4]   Comparing the likelihood functions of phylogenetic trees [J].
Bar-Hen, A ;
Kishino, H .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 2000, 52 (01) :43-56
[5]   Evaluating hypotheses on the origin and evolution of the New Zealand alpine cicadas (maoricicada) using multiple-comparison tests of tree topology [J].
Buckley, TR ;
Simon, C ;
Shimodaira, H ;
Chambers, GK .
MOLECULAR BIOLOGY AND EVOLUTION, 2001, 18 (02) :223-234
[6]   Model misspecification and probabilistic tests of topology: Evidence from empirical data sets [J].
Buckley, TR .
SYSTEMATIC BIOLOGY, 2002, 51 (03) :509-523
[7]   Combined data, Bayesian phylogenetics, and the origin of the New Zealand cicada genera [J].
Buckley, TR ;
Arensburger, P ;
Simon, C ;
Chambers, GK .
SYSTEMATIC BIOLOGY, 2002, 51 (01) :4-18
[8]   PARTITIONING AND COMBINING DATA IN PHYLOGENETIC ANALYSIS [J].
BULL, JJ ;
HUELSENBECK, JP ;
CUNNINGHAM, CW ;
SWOFFORD, DL ;
WADDELL, PJ .
SYSTEMATIC BIOLOGY, 1993, 42 (03) :384-397
[9]  
Burnham K. P., 1998, MODEL SELECTION INFE
[10]   Interordinal relationships and timescale of eutherian evolution as inferred from mitochondrial genome data [J].
Cao, Y ;
Fujiwara, M ;
Nikaido, M ;
Okada, N ;
Hasegawa, M .
GENE, 2000, 259 (1-2) :149-158