On mean shift-based clustering for circular data

被引:33
作者
Chang-Chien, Shou-Jen [1 ]
Hung, Wen-Liang [2 ]
Yang, Miin-Shen [1 ]
机构
[1] Chung Yuan Christian Univ, Dept Appl Math, Chungli 32023, Taiwan
[2] Natl Hsinchu Univ Educ, Dept Appl Math, Hsinchu, Taiwan
关键词
Circular data; Clustering algorithms; Mean shift; Kernel functions; MIXTURE; KERNEL; MODEL;
D O I
10.1007/s00500-012-0802-z
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Cluster analysis is a useful tool for data analysis. Clustering methods are used to partition a data set into clusters such that the data points in the same cluster are the most similar to each other and the data points in the different clusters are the most dissimilar. The mean shift was originally used as a kernel-type weighted mean procedure that had been proposed as a clustering algorithm. However, most mean shift-based clustering (MSBC) algorithms are used for numeric data. The circular data that are the directional data on the plane have been widely used in data analysis. In this paper, we propose a MSBC algorithm for circular data. Three types of mean shift implementation procedures with nonblurring, blurring and general methods are furthermore compared in which the blurring mean shift procedure is the best and recommended. The proposed MSBC for circular data is not necessary to give the number of cluster. It can automatically find a final cluster number with good clustering centers. Several numerical examples and comparisons with some existing clustering methods are used to demonstrate its effectiveness and superiority of the proposed method.
引用
收藏
页码:1043 / 1060
页数:18
相关论文
共 33 条
[1]  
[Anonymous], 1998, DENSITY ESTIMATION S, DOI DOI 10.1201/9781315140919
[2]  
Banerjee A, 2005, J MACH LEARN RES, V6, P1345
[3]   ESTIMATION IN A BIDIRECTIONAL MIXTURE OF VONMISES DISTRIBUTIONS [J].
BARTELS, R .
BIOMETRICS, 1984, 40 (03) :777-784
[4]   Statistical modelling of directional wind speeds using mixtures of von Mises distributions:: Case study [J].
Carta, Jose A. ;
Bueno, Celia ;
Ramirez, Penelope .
ENERGY CONVERSION AND MANAGEMENT, 2008, 49 (05) :897-907
[5]   MEAN SHIFT, MODE SEEKING, AND CLUSTERING [J].
CHENG, YZ .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1995, 17 (08) :790-799
[6]  
Christianini N., 2000, INTRO SUPPORT VECTOR, P189
[7]   Mean shift: A robust approach toward feature space analysis [J].
Comaniciu, D ;
Meer, P .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2002, 24 (05) :603-619
[8]   Using circular statistics to explore the geography of the journey to work [J].
Corcoran, Jonathan ;
Chhetri, Prem ;
Stimson, Robert .
PAPERS IN REGIONAL SCIENCE, 2009, 88 (01) :119-132
[9]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38
[10]   Model-based clustering on the unit sphere with an illustration using gene expression profiles [J].
Dortet-Bernadet, Jean-Luc ;
Wicker, Nicolas .
BIOSTATISTICS, 2008, 9 (01) :66-80