Algorithm to determine ε-distance parameter in density based clustering

被引:88
作者
Jahirabadkar, Sunita [1 ]
Kulkarni, Parag [1 ]
机构
[1] Coll Engn, Pune, Maharashtra, India
关键词
Data mining; Clustering; Density based clustering; Subspace clustering; High dimensional data; SPATIAL DATABASES;
D O I
10.1016/j.eswa.2013.10.025
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The well known clustering algorithm DBSCAN is founded on the density notion of clustering. However, the use of global density parameter epsilon-distance makes DBSCAN not suitable in varying density datasets. Also, guessing the value for the same is not straightforward. In this paper, we generalise this algorithm in two ways. First, adaptively determine the key input parameter epsilon-distance, which makes DBSCAN independent of domain knowledge satisfying the unsupervised notion of clustering. Second, the approach of deriving epsilon-distance based on checking the data distribution of each dimension makes the approach suitable for subspace clustering, which detects clusters enclosed in various subspaces of high dimensional data. Experimental results illustrate that our approach can efficiently find out the clusters of varying sizes, shapes as well as varying densities. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2939 / 2946
页数:8
相关论文
共 18 条
[1]
ANKERST M, 1999, P ACM INT C MAN DAT
[2]
[Anonymous], 2010, P 1 INT C EXH COMP G
[3]
[Anonymous], 1996, KDD, DOI DOI 10.1023/A:1009745219419
[4]
Assent I., 2008, 8 IEEE INT C DAT MIN, P414
[5]
Baumgartner C., 2004, P 4 INT C DAT MIN IC
[6]
ST-DBSCAN: An algorithm for clustering spatial-temp oral data [J].
Birant, Derya ;
Kut, Alp .
DATA & KNOWLEDGE ENGINEERING, 2007, 60 (01) :208-221
[7]
Clustering, CLUST TUT DNA ARR DA
[8]
A local-density based spatial clustering algorithm with noise [J].
Duan, Lian ;
Xu, Lida ;
Guo, Feng ;
Lee, Jun ;
Yan, Baopin .
INFORMATION SYSTEMS, 2007, 32 (07) :978-986
[9]
Frank A., 2010, UCI machine learning repository, V213
[10]
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00009-5]