Hierarchical Modeling for Estimating Relative Risks of Rare Genetic Variants: Properties of the Pseudo-Likelihood Method

被引:16
作者
Capanu, Marinela [1 ]
Begg, Colin B. [1 ]
机构
[1] Mem Sloan Kettering Canc Ctr, New York, NY 10021 USA
关键词
Bayesian; Genetic risk; Hierarchical models; Pseudo-likelihood; LINEAR MIXED MODELS; UNKNOWN CLINICAL-SIGNIFICANCE; BREAST-CANCER; ENVIRONMENT INTERACTIONS; BIAS CORRECTION; MELANOMA; POLYMORPHISMS; DISPERSION; MUTATIONS; DESIGN;
D O I
10.1111/j.1541-0420.2010.01469.x
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
Many major genes have been identified that strongly influence the risk of cancer. However, there are typically many different mutations that can occur in the gene, each of which may or may not confer increased risk. It is critical to identify which specific mutations are harmful, and which ones are harmless, so that individuals who learn from genetic testing that they have a mutation can be appropriately counseled. This is a challenging task, since new mutations are continually being identified, and there is typically relatively little evidence available about each individual mutation. In an earlier article, we employed hierarchical modeling (Capanu et al., 2008, Statistics in Medicine 27, 1973-1992) using the pseudo-likelihood and Gibbs sampling methods to estimate the relative risks of individual rare variants using data from a case-control study and showed that one can draw strength from the aggregating power of hierarchical models to distinguish the variants that contribute to cancer risk. However, further research is needed to validate the application of asymptotic methods to such sparse data. In this article, we use simulations to study in detail the properties of the pseudo-likelihood method for this purpose. We also explore two alternative approaches: pseudo-likelihood with correction for the variance component estimate as proposed by Lin and Breslow (1996, Journal of the American Statistical Association 91, 1007-1016) and a hybrid pseudo-likelihood approach with Bayesian estimation of the variance component. We investigate the validity of these hierarchical modeling techniques by looking at the bias and coverage properties of the estimators as well as at the efficiency of the hierarchical modeling estimates relative to that of the maximum likelihood estimates. The results indicate that the estimates of the relative risks of very sparse variants have small bias, and that the estimated 95% confidence intervals are typically anti-conservative, though the actual coverage rates are generally above 90%. The widths of the confidence intervals narrow as the residual variance in the second-stage model is reduced. The results also show that the hierarchical modeling estimates have shorter confidence intervals relative to estimates obtained from conventional logistic regression, and that these relative improvements increase as the variants become more rare.
引用
收藏
页码:371 / 380
页数:10
相关论文
共 25 条
[1]  
Aragaki CC, 1997, CANCER EPIDEM BIOMAR, V6, P307
[2]   A design for cancer case-control studies using only incident cases: experience with the GEM study of melanoma [J].
Begg, Colin B. ;
Hummer, Amanda J. ;
Mujumdar, Urvi ;
Armstrong, Bruce K. ;
Kricker, Anne ;
Marrett, Loraine D. ;
Millikan, Robert C. ;
Gruber, Stephen B. ;
Culver, Hoda Anton ;
Zanetti, Roberto ;
Gallagher, Richard P. ;
Dwyer, Terrence ;
Rebbeck, Timothy R. ;
Busam, Klaus ;
From, Lynn ;
Berwick, Marianne .
INTERNATIONAL JOURNAL OF EPIDEMIOLOGY, 2006, 35 (03) :756-764
[3]   Study design: Evaluating gene-environment interactions in the etiology of breast cancer - the WECARE study [J].
Bernstein, JL ;
Langholz, B ;
Haile, RW ;
Bernstein, L ;
Thomas, DC ;
Stovall, M ;
Malone, KE ;
Lynch, CF ;
Olsen, JH ;
Anton-Culver, H ;
Shore, RE ;
Boice, JD ;
Berkowitz, GS ;
Gatti, RA ;
Teitelbaum, SL ;
Smith, SA ;
Rosenstein, BS ;
Borresen-Dale, AL ;
Concannon, P .
BREAST CANCER RESEARCH, 2004, 6 (03) :R199-R214
[4]  
Bishop JN, 2006, BRIT J HOSP MED, V67, P299
[5]   Characterization of BRCA1 and BRCA2 Deleterious Mutations and Variants of Unknown Clinical Significance in Unilateral and Bilateral Breast Cancer: The WECARE Study [J].
Borg, Ake ;
Haile, Robert W. ;
Malone, Kathleen E. ;
Capanu, Marinela ;
Diep, Ahn ;
Torngren, Therese ;
Teraoka, Sharon ;
Begg, Colin B. ;
Thomas, Duncan C. ;
Concannon, Patrick ;
Mellemkjaer, Lene ;
Bernstein, Leslie ;
Tellhed, Lina ;
Xue, Shanyan ;
Olson, Eric R. ;
Liang, Xiaolin ;
Dolle, Jessica ;
Borresen-Dale, Anne-Lise ;
Bernstein, Jonine L. .
HUMAN MUTATION, 2010, 31 (03) :E1200-E1240
[6]  
BRESLOW NE, 1995, BIOMETRIKA, V82, P81
[7]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[8]   The use of hierarchical models for estimating relative risks of individual genetic variants: An application to a study of melanoma [J].
Capanu, Marinela ;
Orlow, Irene ;
Berwick, Marianne ;
Hummer, Amanda J. ;
Thomas, Duncan C. ;
Begg, Colin B. .
STATISTICS IN MEDICINE, 2008, 27 (11) :1973-1992
[9]   Hierarchical modeling of linkage disequilibrum: Genetic structure and spatial relations [J].
Conti, DV ;
Witte, JS .
AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (02) :351-363
[10]  
De Roos AJ, 2003, CANCER EPIDEM BIOMAR, V12, P14