共 25 条
Dealing with missing data in family-based association studies:: A multiple imputation approach
被引:19
作者:
Croiseau, Pascal
Genin, Emmanuelle
Cordell, Heather J.
机构:
[1] INSERM, U535, F-94817 Villejuif, France
[2] Univ Paris Sud, UMR S535, Paris, France
[3] Univ Newcastle, Inst Human Genet, Newcastle Upon Tyne, Tyne & Wear, England
基金:
英国惠康基金;
关键词:
case-parent trio;
conditional logistic regression;
haplotype;
D O I:
10.1159/000100481
中图分类号:
Q3 [遗传学];
学科分类号:
071007 ;
090102 ;
摘要:
To test for association between a disease and a set of linked markers, or to estimate relative risks of disease, several different methods have been developed. Many methods for family data require that individuals be genotyped at the full set of markers and that phase can be reconstructed. Individuals with missing data are excluded from the analysis. This can result in an important decrease in sample size and a loss of information. A possible solution to this problem is to use missing-data likelihood methods. We propose an alternative approach, namely the use of multiple imputation. Briefly, this method consists in estimating from the available data all possible phased genotypes and their respective posterior probabilities. These posterior probabilities are then used to generate replicate imputed data sets via a data augmentation algorithm. We performed simulations to test the efficiency of this approach for case/parent trio data and we found that the multiple imputation procedure generally gave unbiased parameter estimates with correct type 1 error and confidence interval coverage. Multiple imputation had some advantages over missing data likelihood methods with regards to ease of use and model flexibility. Multiple imputation methods represent promising tools in the search for disease susceptibility variants.
引用
收藏
页码:229 / 238
页数:10
相关论文