Accounting for Population Stratification in Practice: A Comparison of the Main Strategies Dedicated to Genome-Wide Association Studies

被引:47
作者
Bouaziz, Matthieu [1 ,2 ]
Ambroise, Christophe [2 ]
Guedj, Mickael [1 ]
机构
[1] Pharnext, Dept Biostat, Paris, France
[2] Univ Evry Val dEssonne, Stat & Genome Lab, UMR CNRS 8071, USC INRA, Evry, France
关键词
GENETIC ASSOCIATION; SIMULATION; SUBSTRUCTURE; INFERENCE;
D O I
10.1371/journal.pone.0028845
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
070301 [无机化学]; 070403 [天体物理学]; 070507 [自然资源与国土空间规划学]; 090105 [作物生产系统与生态工程];
摘要
Genome-Wide Association Studies are powerful tools to detect genetic variants associated with diseases. Their results have, however, been questioned, in part because of the bias induced by population stratification. This is a consequence of systematic differences in allele frequencies due to the difference in sample ancestries that can lead to both false positive or false negative findings. Many strategies are available to account for stratification but their performances differ, for instance according to the type of population structure, the disease susceptibility locus minor allele frequency, the degree of sampling imbalanced, or the sample size. We focus on the type of population structure and propose a comparison of the most commonly used methods to deal with stratification that are the Genomic Control, Principal Component based methods such as implemented in Eigenstrat, adjusted Regressions and Meta-Analyses strategies. Our assessment of the methods is based on a large simulation study, involving several scenarios corresponding to many types of population structures. We focused on both false positive rate and power to determine which methods perform the best. Our analysis showed that if there is no population structure, none of the tests led to a bias nor decreased the power except for the Meta-Analyses. When the population is stratified, adjusted Logistic Regressions and Eigenstrat are the best solutions to account for stratification even though only the Logistic Regressions are able to constantly maintain correct false positive rates. This study provides more details about these methods. Their advantages and limitations in different stratification scenarios are highlighted in order to propose practical guidelines to account for population stratification in Genome-Wide Association Studies.
引用
收藏
页数:13
相关论文
共 42 条
[1]
A tutorial on statistical methods for population association studies [J].
Balding, David J. .
NATURE REVIEWS GENETICS, 2006, 7 (10) :781-791
[2]
Ancestry estimation and correction for population stratification in molecular epidemiologic association studies [J].
Barnholtz-Sloan, Jill S. ;
McEvoy, Brian ;
Shriver, Mark D. ;
Rebbeck, Timothy R. .
CANCER EPIDEMIOLOGY BIOMARKERS & PREVENTION, 2008, 17 (03) :471-477
[3]
Population stratification and spurious allelic association [J].
Cardon, LR ;
Palmer, LJ .
LANCET, 2003, 361 (9357) :598-604
[4]
Fregene: Simulation of realistic sequence-level data in populations and ascertained samples [J].
Chadeau-Hyam, Marc ;
Hoggart, Clive J. ;
O'Reilly, Paul F. ;
Whittaker, John C. ;
De Iorio, Maria ;
Balding, David J. .
BMC BIOINFORMATICS, 2008, 9 (1)
[5]
Qualitative semi-parametric test for genetic associations in case-control designs under structured populations [J].
Chen, HS ;
Zhu, X ;
Zhao, H ;
Zhang, S .
ANNALS OF HUMAN GENETICS, 2003, 67 :250-264
[6]
Simultaneously correcting for population stratification and for genotyping error in case-control association studies [J].
Cheng, K. F. ;
Lin, W. J. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (04) :726-743
[7]
A Critical Evaluation of Genomic Control Methods for Genetic Association Studies [J].
Dadd, Tony ;
Weale, Michael E. ;
Lewis, Cathryn M. .
GENETIC EPIDEMIOLOGY, 2009, 33 (04) :290-298
[8]
Deng HW, 2001, GENETICS, V159, P1319
[9]
Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[10]
A simple and improved correction for population stratification in case-control studies [J].
Epstein, Michael P. ;
Allen, Andrew S. ;
Satten, Glen A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (05) :921-930