Visually driven analysis of movement data by progressive clustering

被引:103
作者
Rinzivillo, Salvatore [1 ]
Pedreschi, Dino [1 ]
Nanni, Mirco [2 ]
Giannotti, Fosca [2 ]
Andrienko, Natalia [3 ]
Andrienko, Gennady [3 ]
机构
[1] Univ Pisa, KDD Lab, Pisa, Italy
[2] CNR, ISTI, KDD Lab, I-56100 Pisa, Italy
[3] Fraunhofer Inst IAIS, St Augustin, Germany
关键词
Trajectories; spatio-temporal data; visual analytics; geovisualization; exploratory data analysis; scalable methods;
D O I
10.1057/palgrave.ivs.9500183
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
The paper investigates the possibilities of using clustering techniques in visual exploration and analysis of large numbers of trajectories, that is, sequences of time-stamped locations of some moving entities. Trajectories are complex spatio-temporal constructs characterized by diverse non-trivial properties. To assess the degree of (dis) similarity between trajectories, specific methods (distance functions) are required. A single distance function accounting for all properties of trajectories, (1) is difficult to build, (2) would require much time to compute, and (3) might be difficult to understand and to use. We suggest the procedure of progressive clustering where a simple distance function with a clear meaning is applied on each step, which leads to easily interpretable outcomes. Successive application of several different functions enables sophisticated analyses through gradual refinement of earlier obtained results. Besides the advantages from the sense-making perspective, progressive clustering enables a rational work organization where time-consuming computations are applied to relatively small potentially interesting subsets obtained by means of 'cheap' distance functions producing quick results. We introduce the concept of progressive clustering by an example of analyzing a large real data set. We also review the existing clustering methods, describe the method OPTICS suitable for progressive clustering of trajectories, and briefly present several distance functions for trajectories. Information Visualization (2008) 7, 225-239. doi: 10.1057/palgrave.ivs.9500183
引用
收藏
页码:225 / 239
页数:15
相关论文
共 25 条
[1]  
Andrienko G., 2007, ACM SIGKDD EXPLORATI, V9, P38, DOI DOI 10.1145/1345448.1345455
[2]  
Ankerst M, 1999, SIGMOD RECORD, VOL 28, NO 2 - JUNE 1999, P49
[3]  
[Anonymous], 1994, P 20 INT C VER LARG
[4]  
ASSENT I, 2007, ACM SIGKDD EXPLORATI, V9, P5
[5]   Multi-step density-based clustering [J].
Brecheisen, S ;
Kriegel, HP ;
Pfeifle, M .
KNOWLEDGE AND INFORMATION SYSTEMS, 2006, 9 (03) :284-308
[6]  
BREUNIG M, 2001, P INT C MAN DAT ACM, P79
[7]  
Ester M., 1996, Proc. Second Int. Conf. Knowl. Discov. Data Min, P226, DOI DOI 10.5555/3001460.3001507
[8]  
Giannotti F., 2007, MOBILITY DATA MINING
[9]  
Goodell H, 2006, INFORMATION VISUALIZATION-BOOK, P117
[10]  
GTING RH, 2005, MOVING OBJECTS DATAB