Detect and adjust for population stratification in population-based association study using genomic control markers:: an application of Affymetrix Genechip® Human Mapping 10K array

被引:38
作者
Hao, K
Li, C
Rosenow, C
Wong, WH
机构
[1] Harvard Univ, Sch Publ Hlth, Dept Biostat, Boston, MA 02115 USA
[2] Dana Farber Canc Inst, Dept Biostat, Boston, MA 02115 USA
[3] Affymetrix, Gen Collaborat, Santa Clara, CA USA
[4] Harvard Univ, Dept Stat, Cambridge, MA 02138 USA
关键词
population stratification; population-based study; association test; genomic control;
D O I
10.1038/sj.ejhg.5201273
中图分类号
Q5 [生物化学]; Q7 [分子生物学];
学科分类号
071010 ; 081704 ;
摘要
Population-based association design is often compromised by false or nonreplicable findings, partially due to population stratification. Genomic control (GC) approaches were proposed to detect and adjust for this confounder. To date, the performance of this strategy has not been extensively evaluated on real data. More than 10000 single-nucleotide polymorphisms (SNPs) were genotyped on subjects from four populations ( including an Asian, an African-American and two Caucasian populations) using GeneChip(R) Mapping 10 K array. On these data, we tested the performance of two GC approaches in different scenarios including various numbers of GC markers and different degrees of population stratification. In the scenario of substantial population stratification, both GC approaches are sensitive using only 20-50 random SNPs, and the mixed subjects can be separated into homogeneous subgroups. In the scenario of moderate stratification, both GC approaches have poor sensitivities. However, the bias in association test can still be corrected even when no statistical significant population stratification is detected. We conducted extensive benchmark analyses on GC approaches using SNPs over the whole human genome. We found GC method can cluster subjects to homogeneous subgroups if there is a substantial difference in genetic background. The inflation factor, estimated by GC markers, can effectively adjust for the confounding effect of population stratification regardless of its extent. We also suggest that as low as 50 random SNPs with heterozygosity >40% should be sufficient as genomic controls.
引用
收藏
页码:1001 / 1006
页数:6
相关论文
共 15 条
[1]   The power of genomic control [J].
Bacanu, SA ;
Devlin, B ;
Roeder, K .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) :1933-1944
[2]   Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[3]  
EWENS WJ, 1995, AM J HUM GENET, V57, P455
[4]   Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393
[5]   Large-scale genotyping of complex DNA [J].
Kennedy, GC ;
Matsuzaki, H ;
Dong, SL ;
Liu, WM ;
Huang, J ;
Liu, GY ;
Xu, X ;
Cao, MQ ;
Chen, WW ;
Zhang, J ;
Liu, WW ;
Yang, G ;
Di, XJ ;
Ryder, T ;
He, ZJ ;
Surti, U ;
Phillips, MS ;
Boyce-Jacino, MT ;
Fodor, SPA ;
Jones, KW .
NATURE BIOTECHNOLOGY, 2003, 21 (10) :1233-1237
[6]  
KNOWLER WC, 1988, AM J HUM GENET, V43, P520
[7]   GENETIC DISSECTION OF COMPLEX TRAITS [J].
LANDER, ES ;
SCHORK, NJ .
SCIENCE, 1994, 265 (5181) :2037-2048
[8]   The effects of human population structure on large genetic association studies [J].
Marchini, J ;
Cardon, LR ;
Phillips, MS ;
Donnelly, P .
NATURE GENETICS, 2004, 36 (05) :512-517
[9]   Tests and estimates of allelic association in complex inheritance [J].
Morton, NE ;
Collins, A .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (19) :11389-11393
[10]  
Pritchard JK, 2000, GENETICS, V155, P945