Dealing with missing data in family-based association studies:: A multiple imputation approach

被引:19
作者
Croiseau, Pascal
Genin, Emmanuelle
Cordell, Heather J.
机构
[1] INSERM, U535, F-94817 Villejuif, France
[2] Univ Paris Sud, UMR S535, Paris, France
[3] Univ Newcastle, Inst Human Genet, Newcastle Upon Tyne, Tyne & Wear, England
基金
英国惠康基金;
关键词
case-parent trio; conditional logistic regression; haplotype;
D O I
10.1159/000100481
中图分类号
Q3 [遗传学];
学科分类号
071007 ; 090102 ;
摘要
To test for association between a disease and a set of linked markers, or to estimate relative risks of disease, several different methods have been developed. Many methods for family data require that individuals be genotyped at the full set of markers and that phase can be reconstructed. Individuals with missing data are excluded from the analysis. This can result in an important decrease in sample size and a loss of information. A possible solution to this problem is to use missing-data likelihood methods. We propose an alternative approach, namely the use of multiple imputation. Briefly, this method consists in estimating from the available data all possible phased genotypes and their respective posterior probabilities. These posterior probabilities are then used to generate replicate imputed data sets via a data augmentation algorithm. We performed simulations to test the efficiency of this approach for case/parent trio data and we found that the multiple imputation procedure generally gave unbiased parameter estimates with correct type 1 error and confidence interval coverage. Multiple imputation had some advantages over missing data likelihood methods with regards to ease of use and model flexibility. Multiple imputation methods represent promising tools in the search for disease susceptibility variants.
引用
收藏
页码:229 / 238
页数:10
相关论文
共 25 条
[1]   Genetic interaction of CTLA-4 with HLA-DR15 in multiple sclerosis patients [J].
Alizadeh, M ;
Babron, MC ;
Birebent, B ;
Matsuda, F ;
Quelvennec, E ;
Liblau, R ;
Cournu-Rebeix, I ;
Momigliano-Richiardi, P ;
Sequeiros, J ;
Yaouanq, J ;
Genin, E ;
Vasilescu, A ;
Bougerie, H ;
Trojano, M ;
Silva, BM ;
Maciel, P ;
Clerget-Darpoux, F ;
Clanet, M ;
Edan, G ;
Fontaine, B ;
Semana, G .
ANNALS OF NEUROLOGY, 2003, 54 (01) :119-122
[2]   A generalization of the transmission/disequilibrium test for uncertain-haplotype transmission [J].
Clayton, D .
AMERICAN JOURNAL OF HUMAN GENETICS, 1999, 65 (04) :1170-1177
[3]   Estimation and testing of genotype and haplotype effects in case-control studies: Comparison of weighted regression and multiple imputation procedures [J].
Cordell, HJ .
GENETIC EPIDEMIOLOGY, 2006, 30 (03) :259-275
[4]   Case/pseudocontrol analysis in genetic association studies: A unified framework for detection of genotype and haplotype associations, gene-gene and gene-environment interactions, and parent-of-origin effects [J].
Cordell, HJ ;
Barratt, BJ ;
Clayton, DG .
GENETIC EPIDEMIOLOGY, 2004, 26 (03) :167-185
[5]   A unified stepwise regression procedure for evaluating the relative effects of polymorphisms within a gene using case/control or family data:: Application to HLA in type 1 diabetes [J].
Cordell, HJ ;
Clayton, DG .
AMERICAN JOURNAL OF HUMAN GENETICS, 2002, 70 (01) :124-141
[6]   Stochastic algorithms for Markov models estimation with intermittent missing data [J].
Deltour, I ;
Richardson, S ;
Le Hesran, JY .
BIOMETRICS, 1999, 55 (02) :565-573
[7]   Unbiased application of the transmission/disequilibrium test to multilocus haplotypes [J].
Dudbridge, F ;
Koeleman, BPC ;
Todd, JA ;
Clayton, DG .
AMERICAN JOURNAL OF HUMAN GENETICS, 2000, 66 (06) :2009-2012
[8]   Pedigree disequilibrium tests for multilocus haplotypes [J].
Dudbridge, F .
GENETIC EPIDEMIOLOGY, 2003, 25 (02) :115-121
[9]   Family-based tests for associating haplotypes with general phenotype data: Application to asthma genetics [J].
Horvath, S ;
Xu, X ;
Lake, SL ;
Silverman, EK ;
Weiss, ST ;
Laird, NM .
GENETIC EPIDEMIOLOGY, 2004, 26 (01) :61-69
[10]   A method for identifying genes related to a quantitative trait, incorporating multiple siblings and missing parents [J].
Kistner, EO ;
Weinberg, CR .
GENETIC EPIDEMIOLOGY, 2005, 29 (02) :155-165