Multiple imputation after 18+ years

被引:2418
作者
Rubin, DB
机构
[1] Department of Statistics, Harvard University, Cambridge, MA
关键词
confidence validity; missing data; nonresponse in surveys; public-use files; sample surveys; superefficient procedures;
D O I
10.1080/01621459.1996.10476908
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Multiple imputation was designed to handle the problem of missing data in public-use data bases where the data-base constructor and the ultimate user are distinct entities. The objective is valid frequency inference for ultimate users who in general have access only to complete-data software and possess limited knowledge of specific reasons and models for nonresponse. For this situation and objective, I believe that multiple imputation by the data-base constructor is the method of choice. This article first provides a description of the assumed context and objectives, and second, reviews the multiple imputation framework and its standard results. These preliminary discussions are especially important because some recent commentaries on multiple imputation have reflected either misunderstandings of the practical objectives of multiple imputation or misunderstandings of fundamental theoretical results. Then, criticisms of multiple imputation are considered, and, finally, comparisons are made to alternative strategies.
引用
收藏
页码:473 / 489
页数:17
相关论文
共 139 条
[1]  
BARNARD J, 1995, THESIS U CHICAGO
[2]  
BELIN TR, 1993, J AM STAT ASSOC, V88, P1149, DOI 10.2307/2290812
[3]  
BELIN TR, 1990, P GOV STAT SECT AM S, P124
[4]  
BLOXOM B, 1995, J EDUC BEHAV STAT, V20, P1
[5]  
BOSHUIZEN HC, 1995, C95014 TNO
[6]  
BRAND J, 1994, J AM MED INFORM ASSN, P303
[7]  
BRAND J, 1994, UNPUB SCAMC 95
[8]  
BROWNSTONE D, 1991, 9137 MBS U CALIFORNI
[9]  
BROWNSTONE D, 1996, IN PRESS REV ECON
[10]  
BURNS EM, 1993, DOEEIA0555931