Mining co-location patterns with rare events from spatial data sets

被引:109
作者
Huang, Yan [1 ]
Pei, Jian
Xiong, Hui
机构
[1] Univ N Texas, Dept Comp Sci & Engn, Denton, TX 76203 USA
[2] Simon Fraser Univ, Sch Comp Sci, Burnaby, BC V5A 1S6, Canada
[3] Rutgers State Univ, Management Sci & Informat Syst Dept, Newark, NJ 07102 USA
基金
加拿大自然科学与工程研究理事会;
关键词
spatial data mining; co-location patterns; spatial association rules;
D O I
10.1007/s10707-006-9827-8
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A co-location pattern is a group of spatial features/events that are frequently co-located in the same region. For example, human cases of West Nile Virus often occur in regions with poor mosquito control and the presence of birds. For co-location pattern mining, previous studies often emphasize the equal participation of every spatial feature. As a result, interesting patterns involving events with substantially different frequency cannot be captured. In this paper, we address the problem of mining co-location patterns with rare spatial features. Specifically, we first propose a new measure called the maximal participation ratio (maxPR) and show that a co-location pattern with a relatively high maxPR value corresponds to a co-location pattern containing rare spatial events. Furthermore, we identify a weak monotonicity property of the maxPR measure. This property can help to develop an efficient algorithm to mine patterns with high maxPR values. As demonstrated by our experiments, our approach is effective in identifying co-location patterns with rare events, and is efficient and scalable for large-scale data sets.
引用
收藏
页码:239 / 260
页数:22
相关论文
共 28 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[3]  
[Anonymous], P 7 INT C DAT MIN KD
[4]  
Arge L., 1998, Proceedings of the Twenty-Fourth International Conference on Very-Large Databases, P570
[5]  
Cheung D., 2004, P 10 ACM SIGKDD INT, P384
[6]  
Chou Y. H., 1997, EXPLORING SPATIAL AN
[7]  
Cressie NA, 1991, STAT SPATIAL DATA
[8]  
*ENV SYST RES I IN, ARCGIS FAM
[9]  
Ester M., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P44
[10]  
Estivill-Castro V, 1998, LECT NOTES ARTIF INT, V1394, P110