Estimation and testing of genotype and haplotype effects in case-control studies: Comparison of weighted regression and multiple imputation procedures

被引:44
作者
Cordell, HJ [1 ]
机构
[1] Univ Cambridge, Dept Med Genet, Cambridge Inst Med Res, Addenbrookes Hosp, Cambridge CB2 2XY, England
基金
英国惠康基金;
关键词
association; odds ratio; genotype relative risk;
D O I
10.1002/gepi.20142
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
A popular approach for testing and estimating genotype and haplotype effects associated with a disease outcome is to conduct a population-based case/control study, in which haplotypes are not directly observed but may be inferred probabilistically from unphased genotype data. A variety of methods exist to analyse the resulting data while accounting for the uncertainty in haplotype assignment, but most focus on the issue of testing the global null hypothesis that no genotype or haplotype effects exist. A more interesting question, once a region of disease association has been identified, is to estimate the relevant genotypic or haplotypic effects and to perform tests of complex null hypotheses such as the hypothesis that some loci, but not others, are associated with disease. Here I examine the assumptions behind, and the performance of, two classes of methods for addressing this question. The first is a weighted regression approach in which posterior probabilities of haplotype assignments are used as weights in a logistic regression analysis, generating a test based on either a weighted pseudo-likelihood, or a weighted log-likelihood. The second is a multiple imputation approach using either an improper procedure in which the posterior probabilities are used to generate replicate imputed data sets, or a proper data augmentation procedure. I compare these approaches to a simple expectation substitution (haplotype trend regression) approach. In simulations, all methods gave unbiased parameter estimation but the weighted pseudo-likelihood, expectation substitution and multiple imputation methods had superior confidence interval coverage. For the weighted pseudo-likelihood and expectation substitution methods it was necessary to estimate posterior haplotype assignment probabilities using the combined case/control data, whereas for the multiple imputation approaches it was necessary to estimate these probabilities in the case and control groups separately. Overall, multiple imputation was easiest approach to implement in standard statistical software and to extend to more complex models such as those that include gene-gene or gene-environment interactions.
引用
收藏
页码:259 / 275
页数:17
相关论文
共 51 条
  • [1] Baker SG, 2005, STAT APPL GENET MOL, V4
  • [2] Breslow NE, 1980, STAT METHODS CANC RE, V1, DOI DOI 10.1097/00002030-199912240-00009
  • [3] Breslow NE, 1987, STAT METHODS CANC RE, VII
  • [4] A note on inference of trait associations with SNP haplotypes and other attributes in generalized linear models
    Burkett, K
    McNeney, B
    Graham, J
    [J]. HUMAN HEREDITY, 2004, 57 (04) : 200 - 206
  • [5] Tools for analyzing multiple imputed datasets
    Carlin, John B.
    Li, Ning
    Greenwood, Philip
    Coffey, Carolyn
    [J]. STATA JOURNAL, 2003, 3 (03) : 226 - 244
  • [6] Detecting disease associations due to linkage disequilibrium using haplotype tags: A class of tests and the determinants of statistical power
    Chapman, JM
    Cooper, JD
    Todd, JA
    Clayton, DG
    [J]. HUMAN HEREDITY, 2003, 56 (1-3) : 18 - 31
  • [7] A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission
    Clayton, D
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (04) : 1170 - 1177
  • [8] Use of unphased multilocus genotype data in indirect association studies
    Clayton, D
    Chapman, J
    Cooper, J
    [J]. GENETIC EPIDEMIOLOGY, 2004, 27 (04) : 415 - 428
  • [9] Clayton D., 1993, STAT MODELS EPIDEMIO
  • [10] Hierarchical modeling of linkage disequilibrum: Genetic structure and spatial relations
    Conti, DV
    Witte, JS
    [J]. AMERICAN JOURNAL OF HUMAN GENETICS, 2003, 72 (02) : 351 - 363