Correcting for ascertainment bias in the inference of population structure

被引:23
作者
Guillot, Gilles [1 ,2 ,3 ]
Foll, Matthieu [4 ,5 ]
机构
[1] Univ Oslo, Dept Biol, Ctr Ecol & Evolutionary Synth, N-0316 Oslo, Norway
[2] Chalmers, Gothenburg Stochast Ctr, S-41296 Gothenburg, Sweden
[3] INRA Agro ParisTech, Dept Appl Math & Comp Sci, Paris, France
[4] Univ Bern, Inst Ecol & Evolut, Computat & Mol Populat Genet Lab, CH-3012 Bern, Switzerland
[5] Swiss Inst Bioinformat, Bern, Switzerland
关键词
CORRELATED ALLELE FREQUENCIES; DIFFERENTIATION; POLYMORPHISM; MARKERS; MODEL; LOCI;
D O I
10.1093/bioinformatics/btn665
中图分类号
Q5 [生物化学];
学科分类号
071010 ; 081704 ;
摘要
Background: The ascertainment process of molecular markers amounts to disregard loci carrying alleles with low frequencies. This can result in strong biases in inferences under population genetics models if not properly taken into account by the inference algorithm. Attempting to model this censoring process in view of making inference of population structure (i.e. identifying clusters of individuals) brings up challenging numerical difficulties. Method: These difficulties are related to the presence of intractable normalizing constants in Metropolis-Hastings acceptance ratios. This can be solved via an Markov chain Monte Carlo (MCMC) algorithm known as single variable exchange algorithm (SVEA). Result: We show how this general solution can be implemented for a class of clustering models of broad interest in population genetics that includes the models underlying the computer programs STRUCTURE, GENELAND and GESTE. We also implement the method proposed for a simple example and show that it allows us to reduce the bias substantially.
引用
收藏
页码:552 / 554
页数:3
相关论文
共 19 条
[1]  
ANDRIEU C, 2009, ANN STAT IN PRESS
[2]  
[Anonymous], 2006, P 22 ANN C UNC ART I
[3]  
ATCHADE Y, 2008, ARXIV08043152V1
[4]   Likelihood-based inference for genetic correlation coefficients [J].
Balding, DJ .
THEORETICAL POPULATION BIOLOGY, 2003, 63 (03) :221-230
[5]   A METHOD FOR QUANTIFYING DIFFERENTIATION BETWEEN POPULATIONS AT MULTI-ALLELIC LOCI AND ITS IMPLICATIONS FOR INVESTIGATING IDENTITY AND PATERNITY [J].
BALDING, DJ ;
NICHOLS, RA .
GENETICA, 1995, 96 (1-2) :3-12
[6]  
Beaumont MA, 2003, GENETICS, V164, P1139
[7]   Ascertainment bias in studies of human genome-wide polymorphism [J].
Clark, AG ;
Hubisz, MJ ;
Bustamante, CD ;
Williamson, SH ;
Nielsen, R .
GENOME RESEARCH, 2005, 15 (11) :1496-1502
[8]  
Falush D, 2003, GENETICS, V164, P1567
[9]   An approximate Bayesian computation approach to overcome biases that arise when using amplified fragment length polymorphism markers to study population structure [J].
Foll, Matthieu ;
Beaumont, Mark A. ;
Gaggiotti, Oscar .
GENETICS, 2008, 179 (02) :927-939
[10]   Identifying the environmental factors that determine the genetic structure of Populations [J].
Foll, Matthieu ;
Gaggiotti, Oscar .
GENETICS, 2006, 174 (02) :875-891