Simple estimates of haplotype relative risks in case-control data

被引:39
作者
French, Benjamin
Lumley, Thomas
Monks, Stephanie A.
Rice, Kenneth M.
Hindorff, Lucia A.
Reiner, Alexander P.
Psaty, Bruce M.
机构
[1] Univ Washington, Dept Biostat, Seattle, WA 98195 USA
[2] Univ Washington, Dept Epidemiol, Seattle, WA 98195 USA
[3] Univ Washington, Dept Med, Seattle, WA 98195 USA
[4] Univ Washington, Cardiovasc Hlth Res Unit, Seattle, WA 98195 USA
关键词
linkage disequilibrium; unphased genotypes; phase ambiguity; imputation; weighted logistic regression; gene-environment interaction;
D O I
10.1002/gepi.20161
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Methods of varying complexity have been proposed to efficiently estimate haplotype relative risks in case-control data. Our goal was to compare methods that estimate associations between disease conditions and common haplotypes in large case-control studies such that haplotype imputation is done once as a simple data-processing step. We performed a simulation study based on haplotype frequencies for two renin-angiotensin system genes. The iterative and noniterative methods we compared involved fitting a weighted logistic regression, but differed in how the probability weights were specified. We also quantified the amount of ambiguity in the simulated genes. For one gene, there was essentially no uncertainty in the imputed diplotypes and every method performed well. For the other, similar to 60% of individuals had an unambiguous diplotype, and similar to 90% had a highest posterior probability greater than 0.75. For this gene, all methods performed well under no genetic effects, moderate effects, and strong effects tagged by a single nucleotide polymorphism (SNP). Noniterative methods produced biased estimates under strong effects not tagged by an SNP. For the most likely diplotype, median bias of the logrelative risks ranged between -0.49 and 0.22 over all haplotypes. For all possible diplotypes, median bias ranged between -0.73 and 0.08. Results were similar under interaction with a binary covariate. Noniterative weighted logistic regression provides valid tests for genetic associations and reliable estimates of modest effects of common haplotypes, and can be implemented in standard software. The potential for phase ambiguity does not necessarily imply uncertainty in imputed diplotypes, especially in large studies of common haplotypes. Genet. Epidemiol. 30:485-494, 2006. (c) 2006 Wiley-Liss, Inc.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 16 条
[1]   Serniparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies [J].
Chatterjee, N ;
Carroll, RJ .
BIOMETRIKA, 2005, 92 (02) :399-418
[2]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[3]  
FRENCH B, 2005, HAPLO CCS ESTIMATE H
[4]   Accounting for haplotype uncertainty in matched association studies: A comparison of simple and flexible techniques [J].
Kraft, P ;
Cox, DG ;
Paynter, RA ;
Hunter, D ;
De Vivo, I .
GENETIC EPIDEMIOLOGY, 2005, 28 (03) :261-272
[5]   Estimation and tests of haplotype-environment interaction when linkage phase is ambiguous [J].
Lake, SL ;
Lyon, H ;
Tantisira, K ;
Silverman, EK ;
Weiss, ST ;
Laird, NM ;
Schaid, DJ .
HUMAN HEREDITY, 2003, 55 (01) :56-65
[6]  
LIANG KY, 1986, BIOMETRIKA, V73, P13, DOI 10.1093/biomet/73.1.13
[7]  
Little R.J., 1987, Statistical Analysis With Missing Data
[8]  
LOUIS TA, 1982, J ROY STAT SOC B MET, V44, P226
[9]   Score tests for association between traits and haplotypes when linkage phase is ambiguous [J].
Schaid, DJ ;
Rowland, CM ;
Tines, DE ;
Jacobson, RM ;
Poland, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (02) :425-434
[10]  
Sinnwell J.P., 2005, HAPLO STATS STAT ANA