Improving predictive inference under covariate shift by weighting the log-likelihood function

被引:1051
作者
Shimodaira, H [1 ]
机构
[1] Inst Stat Math, Minato Ku, Tokyo 1068569, Japan
关键词
Akaike information criterion; design of experiments; importance sampling; Kullback-Leibler divergence; misspecification; sample surveys; weighted least squares;
D O I
10.1016/S0378-3758(00)00115-4
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
A class of predictive densities is derived by weighting the observed samples in maximizing the log-likelihood function. This approach is effective in cases such as sample surveys or design of experiments, where the observed covariate follows a different distribution than that in the whole population. Under misspecification of the parametric model, the optimal choice of the weight function is asymptotically shown to be the ratio of the density function of the covariate in the population to that in the observations. This is the pseudo-maximum likelihood estimation of sample surveys. The optimality is defined by the expected Kullback-Leibler loss, and the optimal weight is obtained by considering the importance sampling identity. Under correct specification or the model, however, the ordinary maximum likelihood estimate (i.e. the uniform weight) is shown to be optimal asymptotically. For moderate sample size, the situation is in between the two extreme cases, and the weight function is selected by minimizing a variant of the information criterion derived as an estimate of the expected loss. The method is also applied to a weighted version of the Bayesian predictive density. Numerical examples as well as Monte-Carlo simulations are shown for polynomial regression. A connection with the robust parametric estimation is discussed. (C) 2000 Elsevier Science B.V. All rights reserved. MSC: 62B10; 62D05.
引用
收藏
页码:227 / 244
页数:18
相关论文
共 24 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
Amari S., 1985, LECT NOTES STAT, V28, DOI DOI 10.1007/978-1-4612-5056-2
[3]  
[Anonymous], SELECTING MODELS DAT
[4]   MINIMUM DISPARITY ESTIMATION FOR CONTINUOUS MODELS - EFFICIENCY, DISTRIBUTIONS AND ROBUSTNESS [J].
BASU, A ;
LINDSAY, BG .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1994, 46 (04) :683-705
[5]   An Akaike information criterion for model selection in the presence of incomplete data [J].
Cavanaugh, JE ;
Shumway, RH .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1998, 67 (01) :45-65
[6]  
CRESSIE N, 1984, J ROY STAT SOC B MET, V46, P440
[7]  
DAVISON AC, 1986, BIOMETRIKA, V73, P323
[8]  
DUNSMORE IR, 1976, BIOMETRIKA, V63, P627, DOI 10.1093/biomet/63.3.627
[9]   GEOMETRY OF EXPONENTIAL FAMILIES [J].
EFRON, B .
ANNALS OF STATISTICS, 1978, 6 (02) :362-376
[10]   ROBUST ESTIMATION - A WEIGHTED MAXIMUM-LIKELIHOOD APPROACH [J].
FIELD, C ;
SMITH, B .
INTERNATIONAL STATISTICAL REVIEW, 1994, 62 (03) :405-424