Time-Series Data Mining

被引:683
作者
Esling, Philippe [1 ]
Agon, Carlos [1 ]
机构
[1] IRCAM, F-75004 Paris, France
关键词
Algorithms; Performance; Distance measures; data indexing; data mining; query by content; sequence matching; similarity measures; stream analysis; temporal analysis; time series; SIMILARITY SEARCH; ANOMALY DETECTION; DISTANCE MEASURE; CLASSIFICATION; PREDICTION; PATTERN; MOTIFS; REPRESENTATION; DISCOVERY; RETRIEVAL;
D O I
10.1145/2379776.2379788
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In almost every scientific field, measurements are performed over time. These observations lead to a collection of organized data called time series. The purpose of time-series data mining is to try to extract all meaningful knowledge from the shape of data. Even if humans have a natural capacity to perform these tasks, it remains a complex problem for computers. In this article we intend to provide a survey of the techniques applied for time-series data mining. The first part is devoted to an overview of the tasks that have captured most of the interest of researchers. Considering that in most cases, time-series task relies on the same components for implementation, we divide the literature depending on these common aspects, namely representation techniques, distance measures, and indexing methods. The study of the relevant literature has been categorized for each individual aspects. Four types of robustness could then be formalized and any kind of distance could then be classified. Finally, the study submits various research trends and avenues that can be explored in the near future. We hope that this article can provide a broad and deep understanding of the time-series data mining research field.
引用
收藏
页数:34
相关论文
共 208 条
[1]  
Abonyi J, 2003, LECT NOTES COMPUT SC, V2810, P275, DOI 10.1007/978-3-540-45231-7_26
[2]  
Agrawal R., 1993, Foundations of Data Organization and Algorithms. 4th International Conference. FODO '93 Proceedings, P69
[3]  
Agrawal R., 1995, VLDB '95. Proceedings of the 21st International Conference on Very Large Data Bases, P490
[4]   An Empirical Comparison of Machine Learning Models for Time Series Forecasting [J].
Ahmed, Nesreen K. ;
Atiya, Amir F. ;
El Gayar, Neamat ;
El-Shishiny, Hisham .
ECONOMETRIC REVIEWS, 2010, 29 (5-6) :594-621
[5]  
Ahmed Tarem., 2007, P 2 USENIX WORKSH TA, P1
[6]  
An JY, 2003, LECT NOTES COMPUT SC, V2690, P614
[7]  
[Anonymous], SYMB DAT AN 4 EUR C
[8]  
[Anonymous], LECT NOTES COMPUTER
[9]  
[Anonymous], P 11 INT C EXT DAT T
[10]  
[Anonymous], LECT NOTES COMPUTER