Rare and Low Frequency Variant Stratification in the UK Population: Description and Impact on Association Tests

被引:17
作者
Babron, Marie-Claude [1 ,2 ]
de Tayrac, Marie [1 ,2 ]
Rutledge, Douglas N. [3 ]
Zeggini, Eleftheria [4 ]
Genin, Emmanuelle [1 ,2 ]
机构
[1] INSERM, UMRS 946, Paris, France
[2] Univ Paris Diderot, Inst Univ Hematol, Paris, France
[3] AgroParisTech, Paris, France
[4] Wellcome Trust Sanger Inst, Cambridge, England
来源
PLOS ONE | 2012年 / 7卷 / 10期
基金
英国惠康基金;
关键词
GENOME-WIDE ASSOCIATION; GENETIC ASSOCIATION; LARGE-SCALE; SUSCEPTIBILITY; MODEL; STRATEGIES; ANCESTRY; DISEASE; MARKERS; SAMPLES;
D O I
10.1371/journal.pone.0046519
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Although variations in allele frequencies at common SNPs have been extensively studied in different populations, little is known about the stratification of rare variants and its impact on association tests. In this paper, we used Affymetrix 500K genotype data from the WTCCC to investigate if variants in three different frequency categories (below 1%, between 1 and 5%, above 5%) show different stratification patterns in the UK population. We found that these patterns are indeed different. The top principal component extracted from the rare variant category shows poor correlations with any principal component or combination of principal components from the low frequency or common variant categories. These results could suggest that a suitable solution to avoid false positive association due to population stratification would involve adjusting for the respective PCs when testing for variants in different allele frequency categories. However, we found this was not the case both on type 2 diabetes data and on simulated data. Indeed, adjusting rare variant association tests on PCs derived from rare variants does no better to correct for population stratification than adjusting on PCs derived from more common variants. Mixed models perform slightly better for low frequency variants than PC based adjustments but less well for the rarest variants. These results call for the need of new methodological developments specifically devoted to address rare variant stratification issues in association tests.
引用
收藏
页数:9
相关论文
共 49 条
[31]  
Mathieson I., 2012, Nature Genetics
[32]   Exploring the unknown: assumptions about allelic architecture and strategies for susceptibility variant discovery [J].
McCarthy, Mark I. .
GENOME MEDICINE, 2009, 1
[33]   Genes mirror geography within Europe [J].
Novembre, John ;
Johnson, Toby ;
Bryc, Katarzyna ;
Kutalik, Zoltan ;
Boyko, Adam R. ;
Auton, Adam ;
Indap, Amit ;
King, Karen S. ;
Bergmann, Sven ;
Nelson, Matthew R. ;
Stephens, Matthew ;
Bustamante, Carlos D. .
NATURE, 2008, 456 (7218) :98-U5
[34]   Tracing Sub-Structure in the European American Population with PCA-Informative Markers [J].
Paschou, Peristera ;
Drineas, Petros ;
Lewis, Jamey ;
Nievergelt, Caroline M. ;
Nickerson, Deborah A. ;
Smith, Joshua D. ;
Ridker, Paul M. ;
Chasman, Daniel I. ;
Krauss, Ronald M. ;
Ziv, Elad .
PLOS GENETICS, 2008, 4 (07)
[35]   Population structure and eigenanalysis [J].
Patterson, Nick ;
Price, Alkes L. ;
Reich, David .
PLOS GENETICS, 2006, 2 (12) :2074-2093
[36]   Principal components analysis corrects for stratification in genome-wide association studies [J].
Price, Alkes L. ;
Patterson, Nick J. ;
Plenge, Robert M. ;
Weinblatt, Michael E. ;
Shadick, Nancy A. ;
Reich, David .
NATURE GENETICS, 2006, 38 (08) :904-909
[37]   Association mapping in structured populations [J].
Pritchard, JK ;
Stephens, M ;
Rosenberg, NA ;
Donnelly, P .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 67 (01) :170-181
[38]   Use of unlinked genetic markers to detect population stratification in association studies [J].
Pritchard, JK ;
Rosenberg, NA .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (01) :220-228
[39]   PLINK: A tool set for whole-genome association and population-based linkage analyses [J].
Purcell, Shaun ;
Neale, Benjamin ;
Todd-Brown, Kathe ;
Thomas, Lori ;
Ferreira, Manuel A. R. ;
Bender, David ;
Maller, Julian ;
Sklar, Pamela ;
de Bakker, Paul I. W. ;
Daly, Mark J. ;
Sham, Pak C. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (03) :559-575
[40]   On the allelic spectrum of human disease [J].
Reich, DE ;
Lander, ES .
TRENDS IN GENETICS, 2001, 17 (09) :502-510