Simple and efficient analysis of disease association with missing genotype data

被引:71
作者
Lin, D. Y. [1 ]
Hu, Y. [1 ]
Huang, Be [1 ]
机构
[1] Univ N Carolina, Dept Biostat, Chapel Hill, NC 27599 USA
关键词
D O I
10.1016/j.ajhg.2007.11.004
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Missing genotype data arise in association studies when the single-nucleotide polymorphisms (SNPs) on the genotying platform are not assayed successfully, when the SNPs of interest are not on the platform, or when total sequence variation is determined only on a small fraction of individuals. We present a simple and flexible likelihood framework to study SNP-disease associations with such missing genotype data. Our likelihood makes full use of all available data in case-control studies and reference panels (e.g., the HapMap), and it properly accounts for the biased nature of the case-control sampling as well as the uncertainty in inferring unknown variants. The corresponding maximum-likelihood estimators for genetic effects and gene-environment interactions are unbiased and statistically efficient. We developed fast and stable numerical algorithms to calculate the maximum-likelihood estimators and their variances, and we implemented these algorithms in a freely available computer program. Simulation studies demonstrated that the new approach is more powerful than existing methods while providing accurate control of the type I error. An application to a case-control study on rheumatoid arthritis revealed several loci that deserve further investigations.
引用
收藏
页码:444 / 452
页数:9
相关论文
共 21 条
[1]   A haplotype map of the human genome [J].
Altshuler, D ;
Brooks, LD ;
Chakravarti, A ;
Collins, FS ;
Daly, MJ ;
Donnelly, P ;
Gibbs, RA ;
Belmont, JW ;
Boudreau, A ;
Leal, SM ;
Hardenbol, P ;
Pasternak, S ;
Wheeler, DA ;
Willis, TD ;
Yu, FL ;
Yang, HM ;
Zeng, CQ ;
Gao, Y ;
Hu, HR ;
Hu, WT ;
Li, CH ;
Lin, W ;
Liu, SQ ;
Pan, H ;
Tang, XL ;
Wang, J ;
Wang, W ;
Yu, J ;
Zhang, B ;
Zhang, QR ;
Zhao, HB ;
Zhao, H ;
Zhou, J ;
Gabriel, SB ;
Barry, R ;
Blumenstiel, B ;
Camargo, A ;
Defelice, M ;
Faggart, M ;
Goyette, M ;
Gupta, S ;
Moore, J ;
Nguyen, H ;
Onofrio, RC ;
Parkin, M ;
Roy, J ;
Stahl, E ;
Winchester, E ;
Ziaugra, L ;
Shen, Y .
NATURE, 2005, 437 (7063) :1299-1320
[2]   A tutorial on statistical methods for population association studies [J].
Balding, David J. .
NATURE REVIEWS GENETICS, 2006, 7 (10) :781-791
[3]   Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering [J].
Browning, Sharon R. ;
Browning, Brian L. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 81 (05) :1084-1097
[4]   Efficiency and power in genetic association studies [J].
de Bakker, PIW ;
Yelensky, R ;
Pe'er, I ;
Gabriel, SB ;
Daly, MJ ;
Altshuler, D .
NATURE GENETICS, 2005, 37 (11) :1217-1223
[5]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[6]   Whole-genome patterns of common DNA variation in three human populations [J].
Hinds, DA ;
Stuve, LL ;
Nilsen, GB ;
Halperin, E ;
Eskin, E ;
Ballinger, DG ;
Frazer, KA ;
Cox, DR .
SCIENCE, 2005, 307 (5712) :1072-1079
[7]   The use of inferred haplotypes in downstream analyses [J].
Lin, D. Y. ;
Huang, B. E. .
AMERICAN JOURNAL OF HUMAN GENETICS, 2007, 80 (03) :577-579
[8]   Evaluating statistical significance in two-stage genomewide association studies [J].
Lin, DY .
AMERICAN JOURNAL OF HUMAN GENETICS, 2006, 78 (03) :505-509
[9]   Likelihood-based inference on haplotype effects in genetic association studies [J].
Lin, DY ;
Zeng, D .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2006, 101 (473) :89-104
[10]   New models of collaboration in genome-wide association studies: the Genetic Association Information Network [J].
Manolio, Teri A. ;
Rodriguez, Laura Lyman ;
Brooks, Lisa ;
Abecasis, Goncalo ;
Ballinger, Dennis ;
Daly, Mark ;
Donnelly, Peter ;
Faraone, Stephen V. ;
Frazer, Kelly ;
Gabriel, Stacey ;
Gejman, Pablo ;
Guttmacher, Alan ;
Harris, Emily L. ;
Insel, Thomas ;
Kelsoe, John R. ;
Lander, Eric ;
McCowin, Norma ;
Mailman, Matthew D. ;
Nabel, Elizabeth ;
Ostell, James ;
Pugh, Elizabeth ;
Sherry, Stephen ;
Sullivan, Patrick F. ;
Thompson, John F. ;
Warram, James ;
Study, GoKinD ;
Wholley, David ;
Milos, Patrice M. ;
Collins, Francis S. .
NATURE GENETICS, 2007, 39 (09) :1045-1051