Genotype-Based Matching to Correct for Population Stratification in Large-Scale Case-Control Genetic Association Studies

被引:36
作者
Guan, Weihua
Liang, Liming
Boehnke, Michael
Abecasis, Goncalo R. [1 ]
机构
[1] Univ Michigan, Sch Publ Hlth, Dept Biostat, Ann Arbor, MI 48109 USA
基金
美国国家卫生研究院;
关键词
population stratification; genome-wide association; genetic similarity; GENOME-WIDE ASSOCIATION; RISK LOCI; VARIANTS; DESIGNS; SAMPLES; TESTS;
D O I
10.1002/gepi.20403
中图分类号
Q3 [遗传学];
学科分类号
071007 [遗传学];
摘要
Genome-wide association studies are helping to dissect the etiology of complex diseases. Although case-control association tests are generally more powerful than family-based association tests, population stratification can lead to spurious disease-marker association or mask a true association. Several methods have been proposed to match cases and controls prior to genotyping, using family information or epidemiological data, or using genotype data for a modest number of genetic markers. Here, we describe a genetic similarity score matching (GSM) method for efficient matched analysis of cases and controls in a genome-wide or large-scale candidate gene association study. GSM comprises three steps: (1) calculating similarity scores for pairs of individuals using the genotype data; (2) matching sets of cases and controls based on the similarity scores so that matched cases and controls have similar genetic background; and (3) using conditional logistic regression to perform association tests. Through computer simulation we show that GSM correctly controls false-positive rates and improves power to detect true disease predisposing variants. We compare GSM to genomic control using computer simulations, and find improved power using GSM. We suggest that initial matching of cases and controls prior to genotyping combined with careful re-matching after genotyping is a method of choice for genome-wide association studies. Genet. Epidemiol. 33:508-517, 2009. (C) 2009 Wiley-Liss, Inc.
引用
收藏
页码:508 / 517
页数:10
相关论文
共 35 条
[1]
BERTSEKAS DP, 1994, LIDSP2276 MIT DEP EL
[2]
CHASE GR, 1968, BIOMETRIKA, V55, P365, DOI 10.1093/biomet/55.2.365
[3]
Mapping determinants of human gene expression by regional and genome-wide association [J].
Cheung, VG ;
Spielman, RS ;
Ewens, KG ;
Weber, TM ;
Morley, M ;
Burdick, JT .
NATURE, 2005, 437 (7063) :1365-1369
[4]
Genomic control for association studies [J].
Devlin, B ;
Roeder, K .
BIOMETRICS, 1999, 55 (04) :997-1004
[5]
Ding C., 2004, Proc. of Int'l Conf. Machine Learning, P225
[6]
Coalescents and genealogical structure under neutrality [J].
Donnelly, P ;
Tavare, S .
ANNUAL REVIEW OF GENETICS, 1995, 29 :401-421
[7]
A simple and improved correction for population stratification in case-control studies [J].
Epstein, Michael P. ;
Allen, Andrew S. ;
Satten, Glen A. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (05) :921-930
[8]
A computational study of cost reoptimization for min-cost flow problems [J].
Frangioni, A ;
Manca, A .
INFORMS JOURNAL ON COMPUTING, 2006, 18 (01) :61-70
[9]
A second generation human haplotype map of over 3.1 million SNPs [J].
Frazer, Kelly A. ;
Ballinger, Dennis G. ;
Cox, David R. ;
Hinds, David A. ;
Stuve, Laura L. ;
Gibbs, Richard A. ;
Belmont, John W. ;
Boudreau, Andrew ;
Hardenbol, Paul ;
Leal, Suzanne M. ;
Pasternak, Shiran ;
Wheeler, David A. ;
Willis, Thomas D. ;
Yu, Fuli ;
Yang, Huanming ;
Zeng, Changqing ;
Gao, Yang ;
Hu, Haoran ;
Hu, Weitao ;
Li, Chaohua ;
Lin, Wei ;
Liu, Siqi ;
Pan, Hao ;
Tang, Xiaoli ;
Wang, Jian ;
Wang, Wei ;
Yu, Jun ;
Zhang, Bo ;
Zhang, Qingrun ;
Zhao, Hongbin ;
Zhao, Hui ;
Zhou, Jun ;
Gabriel, Stacey B. ;
Barry, Rachel ;
Blumenstiel, Brendan ;
Camargo, Amy ;
Defelice, Matthew ;
Faggart, Maura ;
Goyette, Mary ;
Gupta, Supriya ;
Moore, Jamie ;
Nguyen, Huy ;
Onofrio, Robert C. ;
Parkin, Melissa ;
Roy, Jessica ;
Stahl, Erich ;
Winchester, Ellen ;
Ziaugra, Liuda ;
Altshuler, David ;
Shen, Yan .
NATURE, 2007, 449 (7164) :851-U3
[10]
Assessing the impact of population stratification on genetic association studies [J].
Freedman, ML ;
Reich, D ;
Penney, KL ;
McDonald, GJ ;
Mignault, AA ;
Patterson, N ;
Gabriel, SB ;
Topol, EJ ;
Smoller, JW ;
Pato, CN ;
Pato, MT ;
Petryshen, TYL ;
Kolonel, LN ;
Lander, ES ;
Sklar, P ;
Henderson, B ;
Hirschhorn, JN ;
Altshuler, D .
NATURE GENETICS, 2004, 36 (04) :388-393