Analysis of case-control studies of genetic and environmental factors with missing genetic information and haplotype-phase ambiguity

被引:51
作者
Spinka, C
Carroll, RJ
Chatterjee, N
机构
[1] NCI, Div Canc Epidemiol & Genet, Biostat Branch, Rockville, MD 20852 USA
[2] Univ Missouri, Dept Stat, Columbia, MO 65211 USA
[3] Texas A&M Univ, Dept Stat, College Stn, TX 77843 USA
关键词
case-control studies; gene-environment interactions; EM algorithm; haplotype; semiparametric methods;
D O I
10.1002/gepi.20085
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
Case-control studies of unrelated subjects are now widely used to study the role of genetic susceptibility and gene-environment interactions in the etiology of complex diseases. Exploiting an assumption of gene-environment independence, and treating the distribution of environmental exposures as completely nonparametric, Chatterjee and Carroll [2005] (Biometrika 92:399-418) recently developed an efficient retrospective maximum-likelihood method for analysis of case-control studies. In this article, we develop an extension of the retrospective maximum-likelihood approach to studies where genetic information may be missing on some study subjects. In particular, special emphasis is given to haplotype-based studies where missing data arise due to linkage-phase ambiguity of genotype data. We use a profile likelihood technique and an appropriate expectation-maximization (EM) algorithm to derive a relatively simple procedure for parameter estimation, with or without a rare disease assumption, and possibly incorporating information on the marginal probability of the disease for the underlying population. We also describe two alternative robust approaches that are less sensitive to the underlying gene-environment independence and Hardy-Weinberg-equilibrium assumptions. The performance of the proposed methods is studied using simulation studies in the context of haplotype-based studies of gene-environment interactions. An application of the proposed method is illustrated using a case-control study of ovarian cancer designed to investigate the interaction between BRCA1/2 mutations and reproductive risk factors in the etiology of ovarian cancer.
引用
收藏
页码:108 / 127
页数:20
相关论文
共 15 条
[1]   Limitations of the case-only design for identifying gene-environment interactions [J].
Albert, PS ;
Ratnasinghe, D ;
Tangrea, J ;
Wacholder, S .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2001, 154 (08) :687-693
[2]  
ANDERSEN EB, 1970, J ROY STAT SOC B, V32, P283
[3]   Serniparametric maximum likelihood estimation exploiting gene-environment independence in case-control studies [J].
Chatterjee, N ;
Carroll, RJ .
BIOMETRIKA, 2005, 92 (02) :399-418
[4]   Inference on haplotype effects in case-control studies using unphased genotype data [J].
Epstein, MP ;
Satten, GA .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 73 (06) :1316-1329
[5]  
EXCOFFIER L, 1995, MOL BIOL EVOL, V12, P921
[6]   Estimation and tests of haplotype-environment interaction when linkage phase is ambiguous [J].
Lake, SL ;
Lyon, H ;
Tantisira, K ;
Silverman, EK ;
Weiss, ST ;
Laird, NM ;
Schaid, DJ .
HUMAN HEREDITY, 2003, 55 (01) :56-65
[7]   Parity, oral contraceptives, and the risk of ovarian cancer among carriers and noncarriers of a BRCA1 or BRCA2 mutation [J].
Modan, B ;
Hartge, P ;
Hirsh-Yechezkel, G ;
Chetrit, A ;
Lubin, F ;
Beller, U ;
Ben-Baruch, G ;
Fishman, A ;
Menczer, J ;
Struewing, JP ;
Tucker, MA ;
Wacholder, S ;
Ebbers, SM ;
Friedman, E ;
Piura, B .
NEW ENGLAND JOURNAL OF MEDICINE, 2001, 345 (04) :235-240
[8]   LOGISTIC DISEASE INCIDENCE MODELS AND CASE-CONTROL STUDIES [J].
PRENTICE, RL ;
PYKE, R .
BIOMETRIKA, 1979, 66 (03) :403-411
[9]   A semiparametric mixture approach to case-control studies with errors in covariables [J].
Roeder, K ;
Carroll, RJ ;
Lindsay, BG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (434) :722-732
[10]   Comparison of prospective and retrospective methods for haplotype inference in case-control studies [J].
Satten, GA ;
Epstein, MP .
GENETIC EPIDEMIOLOGY, 2004, 27 (03) :192-201