On the relationships between clustering and spatial co-location pattern mining

被引:18
作者
Huang, Yan [1 ]
Zhang, Pusheng [2 ]
Zhang, Chengyang [1 ]
机构
[1] Univ N Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
[2] Microsoft Corp, Redmond, WA 98052 USA
关键词
co-location; clustering; proximity function;
D O I
10.1142/S0218213008003777
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The goal of spatial co-location pattern mining is to find subsets of spatial features frequently located together in spatial proximity. Example co-location patterns include services requested frequently and located together from mobile devices (e. g., PDAs and cellular phones) and symbiotic species in ecology (e. g., Nile crocodile and Egyptian plover). Spatial clustering groups similar spatial objects together. Reusing research results in clustering, e. g. algorithms and visualization techniques, by mapping co-location mining problem into a clustering problem would be very useful. However, directly clustering spatial objects from various spatial features may not yield well-defined co-location patterns. Clustering spatial objects in each layer followed by overlaying the layers of clusters may not applicable to many application domains where the spatial objects in some layers are not clustered. In this paper, we propose a new approach to the problem of mining co-location patterns using clustering techniques. First, we propose a novel framework for co-location mining using clustering techniques. We show that the proximity of two spatial features can be captured by summarizing their spatial objects embedded in a continuous space via various techniques. We de. ne the desired properties of proximity functions compared to similarity functions in clustering. Furthermore, we summarize the properties of a list of popular spatial statistical measures as the proximity functions. Finally, we show that clustering techniques can be applied to reveal the rich structure formed by co-located spatial features. A case study on real datasets shows that our method is effective for mining co-locations from large spatial datasets.
引用
收藏
页码:55 / 70
页数:16
相关论文
共 15 条
[1]  
Agarwal R., 1994, P 20 INT C VER LARG
[2]  
[Anonymous], 2005, [Data mining: concepts and techniques]
[3]  
Cressie NA, 1991, STAT SPATIAL DATA
[4]  
Ester M., 1996, P 2 INT C KNOWL DISC, P226, DOI DOI 10.5555/3001460.3001507
[5]  
Estivill-Castro V., 1998, P 2 PAC AS C KNOWL D
[6]  
ESTIVILLCASTRO V, 2001, P 6 INT C GEOC
[7]  
HUANG Y, 2004, IEEE T KNOWLEDGE DAT, V16
[8]  
Jain K, 1988, Algorithms for clustering data
[9]  
KOPERSKI K, 1995, P 4 INT S SPAT DAT
[10]  
MCMAHON, 2001, ENV MANAGEMENT, V28