Exploiting gene-environment independence for analysis of case-control studies: An empirical bayes-type shrinkage estimator to trade-off between bias and efficiency

被引:146
作者
Mukherjee, Bhramar [1 ]
Chatterjee, Nilanjan [2 ]
机构
[1] Univ Michigan, Dept Biostat, Ann Arbor, MI 48109 USA
[2] NCI, Div Canc Epidemiol & Genet, NIH, Dept Hlth & Human Serv, Rockville, MD 20852 USA
关键词
case-only designs; gene-environment interaction; profile likelihood; retrospective analysis; semiparametrics;
D O I
10.1111/j.1541-0420.2007.00953.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Standard prospective logistic regression analysis of case-control data often leads to very imprecise estimates of gene-environment interactions due to small numbers of cases or controls in cells of crossing genotype and exposure. In contrast, under the assumption of gene-environment independence, modern "retrospective" methods, including the "case-only" approach, can estimate the interaction parameters much more precisely, but they can be seriously biased when the underlying assumption of gene-environment independence is violated. In this article, we propose a novel empirical Bayes-type shrinkage estimator to analyze case-control data that can relax the gene-environment independence assumption in a data-adaptive fashion. In the special case, involving a binary gene and a binary exposure, the method leads to an estimator of the interaction log odds ratio parameter in a simple closed form that corresponds to an weighted average of the standard case-only and case-control estimators. We also describe a general approach for deriving the new shrinkage estimator and its variance within the retrospective maximum-likelihood framework developed by Chatterjee and Carroll (2005, Biometrika 92, 399-418). Both simulated and real data examples suggest that the proposed estimator strikes a balance between bias and efficiency depending on the true nature of the gene-environment association and the sample size for a given study.
引用
收藏
页码:685 / 694
页数:10
相关论文
共 28 条
[1]   Limitations of the case-only design for identifying gene-environment interactions [J].
Albert, PS ;
Ratnasinghe, D ;
Tangrea, J ;
Wacholder, S .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2001, 154 (08) :687-693
[2]  
ANDERSEN EB, 1970, J ROY STAT SOC B, V32, P283
[3]  
[Anonymous], 2000, C&H TEXT STAT SCI
[4]  
Berger JO., 1985, STAT DECISION THEORY, DOI DOI 10.1007/978-1-4757-4286-2
[5]   Serniparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies [J].
Chatterjee, N ;
Carroll, RJ .
BIOMETRIKA, 2005, 92 (02) :399-418
[6]   Exploiting Hardy-Weinberg equilibrium for efficient screening of single SNP associations from case-control studies [J].
Chen, Jinbo ;
Chatterjee, Nilanjan .
HUMAN HEREDITY, 2007, 63 (3-4) :196-204
[7]   NOTE ON PARTIALLY BAYES INFERENCE AND LINEAR-MODEL [J].
COX, DR .
BIOMETRIKA, 1975, 62 (03) :651-654
[8]   EMPIRICAL BAYES ON VECTOR OBSERVATIONS - EXTENSION OF STEINS METHOD [J].
EFRON, B ;
MORRIS, C .
BIOMETRIKA, 1972, 59 (02) :335-347
[9]   BAYES AND LIKELIHOOD CALCULATIONS FROM CONFIDENCE-INTERVALS [J].
EFRON, B .
BIOMETRIKA, 1993, 80 (01) :3-26
[10]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329