Standard errors of prediction in generalized linear mixed models

被引:97
作者
Booth, JG [1 ]
Hobert, JP [1 ]
机构
[1] Univ Florida, Dept Stat, Gainesville, FL 32611 USA
关键词
best linear unbiased predictor; bias correction; bootstrap; conditional inference; laplace approximation; small-area estimation; Taylor series;
D O I
10.2307/2669622
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The unconditional mean squared error of prediction (UMSEP) is widely used as a measure of prediction variance for inferences concerning linear combinations of fixed and random effects in the classical normal theory mixed model. But the UMSEP is inappropriate for generalized linear mixed models where the conditional variance of the random effects depends on the data. When the random effects describe variation between independent small domains and domain-specific prediction is of interest, we propose a conditional mean squared error of prediction (CMSEP) as a general measure of prediction variance. The CMSEP is shown to be the sum of the conditional variance and a positive correction that accounts for the sampling variability of parameter estimates. We derive a second-order-correct estimate of the CMSEP that consists of three components: (a) a plug-in estimate of the conditional variance, (b) a plug-in estimate of a Taylor series approximation to the correction term, and (c) a bootstrap estimate of the bias incurred in (a). In the normal case our formulas based on the CMSEP provide a conditional alternative to the unconditional expansions of Fuller and Harter, Kackar and Harville, and Prasad and Rao. In addition, we show that the prediction variance formula obtained by Wolfinger and O'Connell and suggested by Breslow and Clayton is in fact Laplace's approximation to the CMSEP based on the assumption that the variance components are known and ignoring the bias-correction term. Thus this formula has a conditional interpretation in the small-domain setting and should not be interpreted unconditionally. Finally, although use of the CMSEP is motivated using entirely frequentist arguments, our second-order approximation to the CMSEP closely resembles a corresponding expansion for the Bayesian posterior variance.
引用
收藏
页码:262 / 272
页数:11
相关论文
共 46 条
[1]  
[Anonymous], 1979, Multivariate analysis
[2]   AN ERROR-COMPONENTS MODEL FOR PREDICTION OF COUNTY CROP AREAS USING SURVEY AND SATELLITE DATA [J].
BATTESE, GE ;
HARTER, RM ;
FULLER, WA .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1988, 83 (401) :28-36
[3]   A MIXED-EFFECTS MODEL FOR CATEGORICAL-DATA [J].
BEITLER, PJ ;
LANDIS, JR .
BIOMETRICS, 1985, 41 (04) :991-1000
[4]   APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS [J].
BRESLOW, NE ;
CLAYTON, DG .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) :9-25
[5]   APPROXIMATE PREDICTIVE PIVOTS AND DENSITIES [J].
BUTLER, RW .
BIOMETRIKA, 1989, 76 (03) :489-501
[6]  
BUTLER RW, 1986, J ROY STAT SOC B MET, V48, P1
[7]   APPROACHES FOR EMPIRICAL BAYES CONFIDENCE-INTERVALS [J].
CARLIN, BP ;
GELFAND, AE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1990, 85 (409) :105-114
[8]  
CARLIN BP, 1991, J ROY STAT SOC B MET, V53, P189
[9]  
DAVISON AC, 1986, BIOMETRIKA, V73, P323
[10]  
DEBRUIJN NG, 1981, ASYMPTOTIC METHODS A