Generalized additive models for large data sets

被引:209
作者
Wood, Simon N. [1 ]
Goude, Yannig [2 ]
Shaw, Simon [1 ]
机构
[1] Univ Bath, Bath BA2 7AY, Avon, England
[2] Elect France, F-92141 Clamart, France
基金
英国工程与自然科学研究理事会;
关键词
Correlated additive model; Electricity load prediction; Generalized additive model estimation; SMOOTHING PARAMETER-ESTIMATION; REGRESSION; LIKELIHOOD;
D O I
10.1111/rssc.12068
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider an application in electricity grid load prediction, where generalized additive models are appropriate, but where the data set's size can make their use practically intractable with existing methods. We therefore develop practical generalized additive model fitting methods for large data sets in the case in which the smooth terms in the model are represented by using penalized regression splines. The methods use iterative update schemes to obtain factors of the model matrix while requiring only subblocks of the model matrix to be computed at any one time. We show that efficient smoothing parameter estimation can be carried out in a well-justified manner. The grid load prediction problem requires updates of the model fit, as new data become available, and some means for dealing with residual auto-correlation in grid load. Methods are provided for these problems and parallel implementation is covered. The methods allow estimation of generalized additive models for large data sets by using modest computer hardware, and the grid load prediction problem illustrates the utility of reduced rank spline smoothing methods for dealing with complex modelling problems.
引用
收藏
页码:139 / 155
页数:17
相关论文
共 25 条
  • [1] [Anonymous], 2011, P ISAP POW CORD SPAI, DOI DOI 10.1109/ISDA18915.2011
  • [2] [Anonymous], 1994, Nonparametric Regression and Generalized Linear Models: A Roughness Penalty
  • [3] Bates D., 2013, Matrix: sparse and dense matrix classes and methods
  • [4] APPROXIMATE INFERENCE IN GENERALIZED LINEAR MIXED MODELS
    BRESLOW, NE
    CLAYTON, DG
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (421) : 9 - 25
  • [5] Craven P., 1979, Numerische Mathematik, V31, P377, DOI 10.1007/BF01404567
  • [6] De Boor C., 2001, Applied Mathematical Sciences, DOI DOI 10.1007/978-1-4612-6333-3
  • [7] Modulation models for seasonal time series and incidence tables
    Eilers, Paul H. C.
    Gampe, Jutta
    Marx, Brian D.
    Rau, Roland
    [J]. STATISTICS IN MEDICINE, 2008, 27 (17) : 3430 - 3441
  • [8] Flexible smoothing with B-splines and penalties
    Eilers, PHC
    Marx, BD
    [J]. STATISTICAL SCIENCE, 1996, 11 (02) : 89 - 102
  • [9] Fahrmeir L, 2004, STAT SINICA, V14, P731
  • [10] Golub GH., 1989, MATRIX COMPUTATIONS, DOI DOI 10.56021/9781421407944