Using Bayesian model averaging to calibrate forecast ensembles

被引:1378
作者
Raftery, AE [1 ]
Gneiting, T [1 ]
Balabdaoui, F [1 ]
Polakowski, M [1 ]
机构
[1] Univ Washington, Dept Stat, Seattle, WA 98195 USA
关键词
D O I
10.1175/MWR2906.1
中图分类号
P4 [大气科学(气象学)];
学科分类号
0706 ; 070601 ;
摘要
Ensembles used for probabilistic weather forecasting often exhibit a spread-error correlation, but they tend to be underdispersive. This paper proposes a statistical method for postprocessing ensembles based on Bayesian model averaging (BMA), which is a standard method for combining predictive distributions from different sources. The BMA predictive probability density function (PDF) of any quantity of interest is a weighted average of PDFs centered on the individual bias-corrected forecasts, where the weights are equal to posterior probabilities of the models generating the forecasts and reflect the models' relative contributions to predictive skill over the training period. The BMA weights can be used to assess the usefulness of ensemble members, and this can be used as a basis for selecting ensemble members;, this can be useful given the cost of running large ensembles. The BMA PDF can be represented as an unweighted ensemble of any desired size, by simulating from the BMA predictive distribution. The BMA predictive variance can be decomposed into two components, one corresponding to the between-forecast variability, and the second to the within-forecast variability. Predictive PDFs or intervals based solely on the ensemble spread incorporate the first component but not the second. Thus BMA provides a theoretical explanation of the tendency of ensembles to exhibit a spread-error correlation but yet be underdispersive. The method was applied to 48-h forecasts of surface temperature in the Pacific Northwest in January-June 2000 using the University of Washington fifth-generation Pennsylvania State University-NCAR Mesoscale Model (MM5) ensemble. The predictive PDFs were much better calibrated than the raw ensemble, and the BMA forecasts were sharp in that 90% BMA prediction intervals were 66% shorter on average than those produced by sample climatology. As a by-product, BMA yields a deterministic point forecast, and this had root-mean-square errors 7% lower than the best of the ensemble members and 8% lower than the ensemble mean. Similar results were obtained for forecasts of sea level pressure. Simulation experiments show that BMA performs reasonably well when the underlying ensemble is calibrated, or even overdispersed.
引用
收藏
页码:1155 / 1174
页数:20
相关论文
共 79 条
[51]  
LEITH CE, 1974, MON WEATHER REV, V102, P409, DOI 10.1175/1520-0493(1974)102<0409:TSOMCF>2.0.CO
[52]  
2
[53]  
McCullagh P., 2018, Generalized Linear Models
[54]  
MCLACHLAN G., 2000, WILEY SER PROB STAT, DOI 10.1002/0471721182
[55]  
McLachlan G. J., 1997, EM ALGORITHM EXTENSI
[56]   The ECMWF ensemble prediction system: Methodology and validation [J].
Molteni, F ;
Buizza, R ;
Palmer, TN ;
Petroliagis, T .
QUARTERLY JOURNAL OF THE ROYAL METEOROLOGICAL SOCIETY, 1996, 122 (529) :73-119
[57]   Increasing the horizontal resolution of ensemble forecasts at CMC [J].
Pellerin, G ;
Lefaivre, L ;
Houtekamer, P ;
Girard, C .
NONLINEAR PROCESSES IN GEOPHYSICS, 2003, 10 (06) :463-468
[58]  
Raftery A. E., 1993, Testing structural equation models, Issue 5, V154, P163
[59]   Discussion: Performance of Bayesian model averaging [J].
Raftery, AE ;
Zheng, YY .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (464) :931-938
[60]  
Roulston MS, 2002, MON WEATHER REV, V130, P1653, DOI 10.1175/1520-0493(2002)130<1653:EPFUIT>2.0.CO