Imputations of missing values in practice: Results from imputations of serum cholesterol in 28 cohort studies

被引:215
作者
Barzi, F [1 ]
Woodward, M [1 ]
机构
[1] George Inst Int Hlth, Asia Pacific Cohort Studies Collaborat Secretaria, Camperdown, NSW 2050, Australia
关键词
bias; cholesterol; coronary disease; hazard rate; imputation; meta-analysis; missing data; mortality;
D O I
10.1093/aje/kwh175
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 [公共卫生与预防医学]; 120402 [社会医学与卫生事业管理];
摘要
Missing values, common in epidemiologic studies, are a major issue in obtaining valid estimates. Simulation studies have suggested that multiple imputation is an attractive method for imputing missing values, but it is relatively complex and requires specialized software. For each of 28 studies in the Asia Pacific Cohort Studies Collaboration, a comparison of eight imputation procedures (unconditional and conditional mean, multiple hot deck, expectation maximization, and four different approaches to multiple imputation) and the naive, complete participant analysis are presented in this paper. Criteria used for comparison were the mean and standard deviation of total cholesterol and the estimated coronary mortality hazard ratio for a one-unit increase in cholesterol. Further sensitivity analyses allowed for systematic over- or underestimation of cholesterol. For 22 studies for which less than 10% of the values for cholesterol were missing, and for the pooled Asia Pacific Cohort Studies Collaboration, all methods gave similar results. For studies with roughly 10-60% missing values, clear differences existed between the methods, in which case past research suggests that multiple imputation is the method of choice. For two studies with over 60% missing values, no imputation method seemed to be satisfactory.
引用
收藏
页码:34 / 45
页数:12
相关论文
共 28 条
[1]
Allison P.D., 2001, SERIES QUANTITATIVE, V136
[2]
[Anonymous], 1997, Analysis of incomplete multivariate data
[3]
Multiple imputation of baseline data in the cardiovascular health study [J].
Arnold, AM ;
Kronmal, RA .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 2003, 157 (01) :74-84
[4]
*AS PAC COH STUD C, 1999, CVD PREVENTION, V0002
[5]
Applications of multiple imputation in medical studies: from AIDS as NHANES [J].
Barnard, J ;
Meng, XL .
STATISTICAL METHODS IN MEDICAL RESEARCH, 1999, 8 (01) :17-36
[6]
Clayton, 1999, HOTDECK STATA MODULE
[7]
Bias due to missing exposure data using complete-case analysis in the proportional hazards regression model [J].
Demissie, S ;
LaValley, MP ;
Horton, NJ ;
Glynn, RJ ;
Cupples, LA .
STATISTICS IN MEDICINE, 2003, 22 (04) :545-557
[8]
Imputation of missing values in the case of a multiple item instrument measuring alcohol consumption [J].
Gmel, G .
STATISTICS IN MEDICINE, 2001, 20 (15) :2369-2381
[9]
A critical look at methods for handling missing covariates in epidemiologic regression analyses [J].
Greenland, S ;
Finkle, WD .
AMERICAN JOURNAL OF EPIDEMIOLOGY, 1995, 142 (12) :1255-1264
[10]
HAITOVSKY Y, 1968, J R STAT SOC B, V30, P67