Time-series clustering - A decade review

被引:1121
作者
Aghabozorgi, Saeed [1 ]
Shirkhorshidi, Ali Seyed [1 ]
Teh Ying Wah [1 ]
机构
[1] Univ Malaya, Dept Informat Syst, Fac Comp Sci & Informat Technol, Kuala Lumpur 50603, Malaysia
关键词
Clustering; Time-series; Distance measure; Evaluation measure; Representations; GENE-EXPRESSION DATA; SIMILARITY SEARCH; DIMENSIONALITY REDUCTION; AVERAGING METHOD; REPRESENTATION; ALGORITHMS; MODEL; CLASSIFICATION; COMPRESSION; RECOGNITION;
D O I
10.1016/j.is.2015.04.007
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering is a solution for classifying enormous data when there is not any early knowledge about classes. With emerging new concepts like cloud computing and big data and their vast applications in recent years, research works have been increased on unsupervised solutions like clustering algorithms to extract knowledge from this avalanche of data. Clustering time-series data has been used in diverse scientific areas to discover patterns which empower data analysts to extract valuable information from complex and massive datasets. In case of huge datasets, using supervised classification solutions is almost impossible, while clustering can solve this problem using unsupervised approaches. In this research work, the focus is on time-series data, which is one of the popular data types in clustering problems and is broadly used from gene expression data in biology to stock market analysis in finance. This review will expose four main components of time-series clustering and is aimed to represent an updated investigation on the trend of improvements in efficiency, quality and complexity of clustering time-series approaches during the last decade and enlighten new paths for future works. (C) 2015 Elsevier Ltd. All rights reserved.
引用
收藏
页码:16 / 38
页数:23
相关论文
共 242 条
[31]  
[Anonymous], VASA, DOI DOI 10.1002/1521-3773(20010316)40:63.3.CO
[32]  
2-C
[33]  
[Anonymous], 2001, P WORKSH LEARN TEMP
[34]  
[Anonymous], TEMPORAL DATA MINING
[35]  
[Anonymous], 2007, Proceedings of the 33rd International Conference on Very Large Data Bases. VLDB'07
[36]  
[Anonymous], SIGKDD
[37]  
[Anonymous], 2014, RECENT ADV INTELLIGE
[38]  
[Anonymous], 2004, P 21 INT C MACH LEAR
[39]  
[Anonymous], 2015, Retriev Technologies
[40]  
[Anonymous], INT J COMPUT SCI INF