Likelihood-Free Inference of Population Structure and Local Adaptation in a Bayesian Hierarchical Model

被引:72
作者
Bazin, Eric [1 ]
Dawson, Kevin J. [2 ]
Beaumont, Mark A. [1 ]
机构
[1] Univ Reading, Sch Biol Sci, Reading RG6 6BX, Berks, England
[2] Rothamsted Res, Harpenden AL5 2JQ, Herts, England
基金
英国生物技术与生命科学研究理事会; 英国工程与自然科学研究理事会;
关键词
ESTIMATING F-STATISTICS; CHAIN MONTE-CARLO; GENETIC DIVERSITY; HUMAN-EVOLUTION; SELECTION; COMPUTATION; GENOME; PARAMETERS; POLYMORPHISMS; SUBDIVISION;
D O I
10.1534/genetics.109.112391
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
We address the problem of finding evidence of natural selection from genetic data, accounting for the confounding effects of demographic history. In the absence of natural selection, gene genealogies should all be sampled from the same underlying distribution, often approximated by a coalescent model. Selection at a particular locus will lead to a modified genealogy, and this motivates a number of recent approaches for detecting the effects of natural selection in the genome as "outliers'' under some models. The demographic history of a population affects the sampling distribution of genealogies, and therefore the observed genotypes and the classification of outliers. Since we cannot see genealogies directly, we have to infer them from the observed data under some model of mutation and demography. Thus the accuracy of an outlier-based approach depends to a greater or a lesser extent on the uncertainty about the demographic and mutational model. A natural modeling framework for this type of problem is provided by Bayesian hierarchical models, in which parameters, such as mutation rates and selection coefficients, are allowed to vary across loci. It has proved quite difficult computationally to implement fully probabilistic genealogical models with complex demographies, and this has motivated the development of approximations such as approximate Bayesian computation (ABC). In ABC the data are compressed into summary statistics, and computation of the likelihood function is replaced by simulation of data under the model. In a hierarchical setting one may be interested both in hyperparameters and parameters, and there may be very many of the latter-for example, in a genetic model, these may be parameters describing each of many loci or populations. This poses a problem for ABC in that one then requires summary statistics for each locus, which, if used naively, leads to a consequent difficulty in conditional density estimation. We develop a general method for applying ABC to Bayesian hierarchical models, and we apply it to detect microsatellite loci influenced by local selection. We demonstrate using receiver operating characteristic (ROC) analysis that this approach has comparable performance to a full-likelihood method and outperforms it when mutation rates are variable across loci.
引用
收藏
页码:587 / 602
页数:16
相关论文
共 61 条
[1]  
[Anonymous], 1961, Applied Statistical Decision Theory
[2]   DNA PROFILE MATCH PROBABILITY CALCULATION - HOW TO ALLOW FOR POPULATION STRATIFICATION, RELATEDNESS, DATABASE SELECTION AND SINGLE BANDS [J].
BALDING, DJ ;
NICHOLS, RA .
FORENSIC SCIENCE INTERNATIONAL, 1994, 64 (2-3) :125-140
[3]   Likelihood-based inference for genetic correlation coefficients [J].
Balding, DJ .
THEORETICAL POPULATION BIOLOGY, 2003, 63 (03) :221-230
[4]   THE BARRIER TO GENETIC EXCHANGE BETWEEN HYBRIDIZING POPULATIONS [J].
BARTON, N ;
BENGTSSON, BO .
HEREDITY, 1986, 57 :357-376
[5]   ELIMINATION OF NUISANCE PARAMETERS [J].
BASU, D .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1977, 72 (358) :355-366
[6]  
Beaumont MA, 2002, GENETICS, V162, P2025
[7]   Identifying adaptive genetic divergence among populations from genome scans [J].
Beaumont, MA ;
Balding, DJ .
MOLECULAR ECOLOGY, 2004, 13 (04) :969-980
[8]   Evaluating loci for use in the genetic analysis of population structure [J].
Beaumont, MA ;
Nichols, RA .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1996, 263 (1377) :1619-1626
[9]   Selection and sticklebacks [J].
Beaumont, Mark A. .
MOLECULAR ECOLOGY, 2008, 17 (15) :3425-3427
[10]   Adaptive approximate Bayesian computation [J].
Beaumont, Mark A. ;
Cornuet, Jean-Marie ;
Marin, Jean-Michel ;
Robert, Christian P. .
BIOMETRIKA, 2009, 96 (04) :983-990