Forecasting histogram time series with k-nearest neighbours methods

被引:71
作者
Arroyo, Javier [1 ]
Mate, Carlos [2 ]
机构
[1] Univ Complutense, Dept Ingn Software & Inteligencia Artificial, E-28040 Madrid, Spain
[2] Univ Pontificia Comillas, Inst Invest Tecnol, ETSI, ICAI, Madrid 28015, Spain
关键词
Density forecast; Finance; Nonlinear time series models; Non-parametric forecasting; Symbolic data analysis; Weather forecast;
D O I
10.1016/j.ijforecast.2008.07.003
中图分类号
F [经济];
学科分类号
02 ;
摘要
Histogram time series (HTS) describe situations where a distribution of values is available for each instant of time. These situations usually arise when contemporaneous or temporal aggregation is required. In these cases, histograms provide a summary of the data that is more informative than those provided by other aggregates such as the mean. Some fields where HTS are useful include economy, official statistics and environmental science. This article adapts the k-Nearest Neighbours (k-NN) algorithm to forecast HTS and, more generally, to deal with histogram data. The proposed k-NN relies on the choice of a distance that is used to measure dissimilarities between sequences of histograms and to compute the forecasts. The Mallows distance and the Wasserstein distance are considered. The forecasting ability of the k-NN adaptation is illustrated with meteorological and financial data, and promising results are obtained. Finally, further research issues are discussed. (C) 2008 International Institute of Forecasters. Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:192 / 207
页数:16
相关论文
共 30 条
[1]  
[Anonymous], ANAL SYMBOLIC DATA
[2]  
[Anonymous], 2007, Selected Contributions in Data Analysis and Classification
[3]  
Aparicio Teresa., 2002, Applied Financial Economics, V12, P517
[4]  
ARROYO J, 2008, EXPONENTIAL SMOOTHIN
[5]   From the statistics of data to the statistics of knowledge: Symbolic data analysis [J].
Billard, L ;
Diday, E .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2003, 98 (462) :470-487
[6]  
BILLARD L, 2006, SYMBLIC DATA ANAL CO
[7]   Neural networks and non-parametric methods for improving real-time flood forecasting through conceptual hydrological models [J].
Brath, A ;
Montanari, A ;
Toth, E .
HYDROLOGY AND EARTH SYSTEM SCIENCES, 2002, 6 (04) :627-639
[8]   NEAREST NEIGHBOR PATTERN CLASSIFICATION [J].
COVER, TM ;
HART, PE .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1967, 13 (01) :21-+
[9]  
Diday E., 2008, SYMBOLIC DATA SODAS
[10]   Evaluating density forecasts with applications to financial risk management [J].
Diebold, FX ;
Gunther, TA ;
Tay, AS .
INTERNATIONAL ECONOMIC REVIEW, 1998, 39 (04) :863-883