Fully Bayesian tests of neutrality using genealogical summary statistics

被引:22
作者
Drummond, Alexei J. [1 ,2 ]
Suchard, Marc A. [3 ,4 ,5 ]
机构
[1] Univ Auckland, Bioinformat Inst, Auckland 1, New Zealand
[2] Univ Auckland, Dept Comp Sci, Auckland 1, New Zealand
[3] Univ Calif Los Angeles, David Geffen Sch Med, Dept Biomath, Los Angeles, CA 90095 USA
[4] Univ Calif Los Angeles, David Geffen Sch Med, Dept Human Genet, Los Angeles, CA 90095 USA
[5] Univ Calif Los Angeles, Sch Publ Hlth, Dept Biostat, Los Angeles, CA 90024 USA
基金
英国惠康基金;
关键词
D O I
10.1186/1471-2156-9-68
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Background: Many data summary statistics have been developed to detect departures from neutral expectations of evolutionary models. However questions about the neutrality of the evolution of genetic loci within natural populations remain difficult to assess. One critical cause of this difficulty is that most methods for testing neutrality make simplifying assumptions simultaneously about the mutational model and the population size model. Consequentially, rejecting the null hypothesis of neutrality under these methods could result from violations of either or both assumptions, making interpretation troublesome. Results: Here we harness posterior predictive simulation to exploit summary statistics of both the data and model parameters to test the goodness-of-fit of standard models of evolution. We apply the method to test the selective neutrality of molecular evolution in non-recombining gene genealogies and we demonstrate the utility of our method on four real data sets, identifying significant departures of neutrality in human influenza A virus, even after controlling for variation in population size. Conclusion: Importantly, by employing a full model-based Bayesian analysis, our method separates the effects of demography from the effects of selection. The method also allows multiple summary statistics to be used in concert, thus potentially increasing sensitivity. Furthermore, our method remains useful in situations where analytical expectations and variances of summary statistics are not available. This aspect has great potential for the analysis of temporally spaced data, an expanding area previously ignored for limited availability of theory and methods.
引用
收藏
页数:12
相关论文
共 61 条
[1]   Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today [J].
Aldous, DJ .
STATISTICAL SCIENCE, 2001, 16 (01) :23-34
[2]  
Avise J.C., 1999, Phylogeography: The history and formation of species
[3]   Dynamics of Pleistocene population extinctions in Beringian brown bears [J].
Barnes, I ;
Matheus, P ;
Shapiro, B ;
Jensen, D ;
Cooper, A .
SCIENCE, 2002, 295 (5563) :2267-2270
[4]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[5]   Selection-driven evolution of emergent dengue virus [J].
Bennett, SN ;
Holmes, EC ;
Chirivella, M ;
Rodriguez, DM ;
Beltran, M ;
Vorndam, V ;
Gubler, DJ ;
McMillan, WO .
MOLECULAR BIOLOGY AND EVOLUTION, 2003, 20 (10) :1650-1658
[6]   Bayesian model adequacy and choice in phylogenetics [J].
Bollback, JP .
MOLECULAR BIOLOGY AND EVOLUTION, 2002, 19 (07) :1171-1180
[7]   SAMPLING AND BAYES INFERENCE IN SCIENTIFIC MODELING AND ROBUSTNESS [J].
BOX, GEP .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES A-STATISTICS IN SOCIETY, 1980, 143 :383-430
[8]  
Colless, 1982, SYST ZOOL, V31, P100, DOI [DOI 10.2307/2413420, 10.2307/2413420]
[9]   Measurably evolving populations [J].
Drummond, AJ ;
Pybus, OG ;
Rambaut, A ;
Forsberg, R ;
Rodrigo, AG .
TRENDS IN ECOLOGY & EVOLUTION, 2003, 18 (09) :481-488
[10]  
Drummond AJ, 2002, GENETICS, V161, P1307