A hypergraph based clustering algorithm for spatial data sets

被引:16
作者
Cherng, JS [1 ]
Lo, MJ [1 ]
机构
[1] Da Yeh Univ, Dept Elect Engn, Changhua, Taiwan
来源
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2001年
关键词
data mining; clustering; hypergraph;
D O I
10.1109/ICDM.2001.989504
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a discovery process in data mining and can he used to group together the objects of a database into meaningful subclasses which serve as the foundation for other data analysis techniques. In this paper, we focus on dealing with a set of spatial data. For the spatial data, the clustering problem becomes that of finding the densely populated regions of the space and thus grouping these regions into clusters such that the intracluster similarity is maximized and the intercluster similarity is minimized. Me develop a novel hierarchical clustering algorithm that uses a hypergraph to represent a set of spatial data. This hypergraph is initially, constructed from the Delaunay triangulation graph of the data set and can correctly capture the relationships among sets of data points. Two phases are developed for the proposed clustering algorithm to find the clusters in the data set. We evaluate our hierarchical clustering algorithm with some spatial data sets in which contain clusters of different sizes, shapes, densities, and noise. Experimental results on these data sets are very, encouraging.
引用
收藏
页码:83 / 90
页数:8
相关论文
共 14 条
[1]  
[Anonymous], 1996, P AAAI INT C KNOWL D
[2]  
Berge C, 1976, Graphs and Hypergraphs
[3]  
BUI T, 1989, P ACM IEEE DES AUT C, P775
[4]   Data mining: An overview from a database perspective [J].
Chen, MS ;
Han, JW ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) :866-883
[5]   Efficient bipartitioning algorithm for size-constrained circuits [J].
Cherng, JS ;
Chen, SJ ;
Ho, JM .
IEE PROCEEDINGS-COMPUTERS AND DIGITAL TECHNIQUES, 1998, 145 (01) :37-45
[6]  
DUTT S, 1996, P IEEE INT C COMP AI, P194
[7]  
GUHA S, 1999, P 15 INT C DAT ENG
[8]  
GUHA S, 1998, P 1998 ACM SIGMOD IN
[9]  
Jain K, 1988, Algorithms for clustering data
[10]   Chameleon: Hierarchical clustering using dynamic modeling [J].
Karypis, G ;
Han, EH ;
Kumar, V .
COMPUTER, 1999, 32 (08) :68-+