Spatial data mining: Database primitives, algorithms and efficient DBMS support

被引:51
作者
Ester, M [1 ]
Frommelt, A [1 ]
Kriegel, HP [1 ]
Sander, J [1 ]
机构
[1] Univ Munich, Inst Comp Sci, D-80538 Munich, Germany
关键词
mining spatial data; database primitives for KDD;
D O I
10.1023/A:1009843930701
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spatial data mining algorithms heavily depend on the efficient processing of neighborhood relations since the neighbors of many objects have to be investigated in a single run of a typical algorithm. Therefore, providing general concepts for neighborhood relations as well as an efficient implementation of these concepts will allow a tight integration of spatial data mining algorithms with a spatial database management system. This will speed up both, the development and the execution of spatial data mining algorithms. In this paper, we define neighborhood graphs and paths and a small set of database primitives for their manipulation. We show that typical spatial data mining algorithms are well supported by the proposed basic operations. For finding significant spatial patterns, only certain classes of paths "leading away" from a starting object are relevant. We discuss filters allowing only such neighborhood paths which will significantly reduce the search space for spatial data mining algorithms. Furthermore, we introduce neighborhood indices to speed up the processing of our database primitives. We implemented the database primitives on top of a commercial spatial database management system. The effectiveness and efficiency of the proposed approach was evaluated by using an analytical cost model and an extensive experimental study on a geographic database.
引用
收藏
页码:193 / 216
页数:24
相关论文
共 16 条
[1]   DATABASE MINING - A PERFORMANCE PERSPECTIVE [J].
AGRAWAL, R ;
IMIELINSKI, T ;
SWAMI, A .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1993, 5 (06) :914-925
[2]  
[Anonymous], P 4 INT S LARG SPAT
[3]  
BILL F, 1909, FUNDAMENTALS GEOGRAP
[4]  
EGENHOFER MJ, 1991, P SSD 91, P143
[5]  
Ester M., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P44
[6]  
Fayyad U. M., 1996, ADV KNOWLEDGE DISCOV, P1, DOI DOI 10.1609/AIMAG.V17I3.1230
[7]  
GUETING RH, 1994, VLDB J, V3
[8]  
Guttman A., 1984, P ACM SIGMOD INT C M, P47, DOI DOI 10.1145/602259.602266
[9]  
KOPERSKI K, 1996, P SIGMOD WORKSH RES
[10]  
KOPERSKI K, 1998, P S SPAT DAT HANDL S