DHeat: A Density Heat-Based Algorithm for Clustering With Effective Radius

被引:34
作者
Chen, Yewang [1 ]
Tang, Shengyu [1 ]
Pei, Songwen [3 ,4 ]
Wang, Cheng [1 ]
Du, Jixiang [2 ]
Xiong, Naixue [3 ]
机构
[1] Huaqiao Univ, Coll Comp Sci & Technol, Sch Comp Sci & Technol, Xiamen 362021, Peoples R China
[2] Huaqiao Univ, Coll Comp Sci & Technol, Xiamen 362021, Peoples R China
[3] Univ Shanghai Sci & Technol, Shanghai Key Lab Modern Opt Syst, Shanghai 200093, Peoples R China
[4] Fudan Univ, Sch Management, Shanghai 200433, Peoples R China
来源
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS | 2018年 / 48卷 / 04期
基金
中国国家自然科学基金;
关键词
Clustering; density deviation; density heat (DHeat); effective radius; r-uniform density; KERNEL; IMAGE;
D O I
10.1109/TSMC.2017.2745493
中图分类号
TP [自动化技术、计算机技术];
学科分类号
080201 [机械制造及其自动化];
摘要
Density-based clustering is one of the most popular paradigms of existing clustering approaches, most approaches of this kind, such as DBSCAN, recognize clusters of data characterized by a fixed scanning radius. However, some flaws are caused by the fixed scanning radius, e.g., the determination of a proper scanning radius is nontrivial. In order to solve these problems, we revise DBSCAN, Meanshift, DPeak, etc. based on two new features, i.e., effective radius and density heat (DHeat). Generally, we name these revised clustering algorithms as DHeat. The underlying idea is based on two assumptions: 1) the existence of clusters is raised by the nonuniformity of data distribution, and the density of one data point within its r-neighborhood is proportional to the volume of the neighborhood provided the density distribution is uniform and 2) each cluster can be divided into different density layers, such as edges, shallow inner, deep inner, etc.; the deeper inner of a point locates, the higher density of that point. The experiments conducted on various test cases show that the advantage of DHeat lies in its good performance and the self-adapting scanning radius.
引用
收藏
页码:649 / 660
页数:12
相关论文
共 44 条
[1]
[Anonymous], CLUST DAT
[2]
[Anonymous], CLUSTER ANAL
[3]
[Anonymous], J OPER RES SOC
[4]
[Anonymous], P ICML BANFF AB CAN
[5]
[Anonymous], 2007, ACM Transactions on Knowledge Discovery from Data, DOI DOI 10.1145/1217299.1217303
[6]
[Anonymous], 2009, Finding Groups in Data: An Introduction to Cluster Analysis
[7]
Fuzzy Based Scalable Clustering Algorithms for Handling Big Data Using Apache Spark [J].
Bharill, Neha ;
Tiwari, Aruna ;
Malviya, Aayushi .
IEEE Transactions on Big Data, 2016, 2 (04) :339-352
[8]
ST-DBSCAN: An algorithm for clustering spatial-temp oral data [J].
Birant, Derya ;
Kut, Alp .
DATA & KNOWLEDGE ENGINEERING, 2007, 60 (01) :208-221
[9]
VARIABLE KERNEL ESTIMATES OF MULTIVARIATE DENSITIES [J].
BREIMAN, L ;
MEISEL, W ;
PURCELL, E .
TECHNOMETRICS, 1977, 19 (02) :135-144
[10]
Robust path-based spectral clustering [J].
Chang, Hong ;
Yeung, Dit-Yan .
PATTERN RECOGNITION, 2008, 41 (01) :191-203