Tools for analyzing multiple imputed datasets

被引:65
作者
Carlin, John B. [1 ]
Li, Ning
Greenwood, Philip [1 ]
Coffey, Carolyn
机构
[1] Murdoch Childrens Res Inst, Clin Epidemiol & Biostat Unit, Parkville, Vic, Australia
关键词
st0042; missing data; multiple imputation; Rubin's rule of combination; overall estimates;
D O I
10.1177/1536867X0300300302
中图分类号
O1 [数学]; C [社会科学总论];
学科分类号
03 ; 0303 ; 0701 ; 070101 ;
摘要
The method of multiple imputation (MI) is used increasingly for analyzing datasets with missing observations. Two sets of tasks are required in order to implement the method: (a) generating multiple complete datasets in which missing values have been imputed by simulating from an appropriate probability distribution and (b) analyzing the multiple imputed datasets and combining complete data inferences from them to form an overall inference for parameters of interest. An increasing number of software tools are available for task (a), although this is difficult to automate, because the method of imputation should depend on the context and available covariate data. When the quantity of missing data is not great, the sensitivity of results to the imputation model may be relatively low. In this context, software tools that enable task (b) to be performed with similar ease to the analysis of a single dataset should facilitate the wider use of multiple imputation. Such tools need not only to implement techniques for inference from multiple imputed datasets but also to allow standard manipulations such as transformation and recoding of variables. In this article, we describe a set of Stata commands that we have developed for manipulating and analyzing multiple datasets.
引用
收藏
页码:226 / 244
页数:19
相关论文
共 6 条
[1]  
[Anonymous], 1999, STATA TECH B
[2]   LARGE-SAMPLE SIGNIFICANCE LEVELS FROM MULTIPLY IMPUTED DATA USING MOMENT-BASED STATISTICS AND AN F-REFERENCE DISTRIBUTION [J].
LI, KH ;
RAGHUNATHAN, TE ;
RUBIN, DB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1991, 86 (416) :1065-1073
[3]  
Little RJA, 1987, Statistical Analysis With Missing Data
[4]  
Raghunathan TE., 2001, SURV METHODOL, V27, P85, DOI DOI 10.1037/A0029315
[5]  
RUBIN DB, 1976, BIOMETRIKA, V63, P581, DOI 10.1093/biomet/63.3.581
[6]  
Schafer J.L, 1997, ANAL INCOMPLETE MULT