Diagnostics for multivariate imputations

被引:129
作者
Abayomi, Kobi [1 ]
Gelman, Andrew [1 ]
Levy, Marc [1 ]
机构
[1] Columbia Univ, Dept Stat, New York, NY 10027 USA
关键词
environmental statistics; missing values; multiple imputation; multivariate statistics; sustainability;
D O I
10.1111/j.1467-9876.2007.00613.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider three sorts of diagnostics for random imputations: displays of the completed data, which are intended to reveal unusual patterns that might suggest problems with the imputations, comparisons of the distributions of observed and imputed data values and checks of the fit of observed data to the model that is used to create the imputations. We formulate these methods in terms of sequential regression multivariate imputation, which is an iterative procedure in which the missing values of each variable are randomly imputed conditionally on all the other variables in the completed data matrix. We also consider a recalibration procedure for sequential regression imputations. We apply these methods to the 2002 environmental sustainability index, which is a linear aggregation of 64 environmental variables on 142 countries.
引用
收藏
页码:273 / 291
页数:19
相关论文
共 19 条
[1]  
Besag J, 2001, STAT SCI, V16, P265
[2]   ROBUST LOCALLY WEIGHTED REGRESSION AND SMOOTHING SCATTERPLOTS [J].
CLEVELAND, WS .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1979, 74 (368) :829-836
[3]  
HECKMAN JJ, 1976, ANN ECON SOC MEAS, V5, P475
[4]  
JOHNSON R, 1998, APPL MULTIVARIATE DA
[5]  
Little R. J., 2019, STAT ANAL MISSING DA, V793, DOI DOI 10.1002
[6]  
LIU C, 1995, J MULTIVARIATE ANAL, V48, P198
[7]  
PRESCOTTALLEN R, 2001, WELBEING NATIONS
[8]  
Raghunathan TE., 2001, SURV METHODOL, V27, P85, DOI DOI 10.1037/A0029315
[9]  
RAGHUNATHAN TE, 2002, IVEWARE
[10]  
Rubin D. B., 1978, P SURVEY RES METHODS, V1, P20, DOI DOI 10.1631/JZUS.C10B0359