A modified rough c-means clustering algorithm based on hybrid imbalanced measure of distance and density

被引:11
作者
Zhang, Tengfei [1 ]
Chen, Long [1 ]
Ma, Fumin [2 ]
机构
[1] Nanjing Univ Posts & Telecommun, Coll Automat, Nanjing 210023, Jiangsu, Peoples R China
[2] Nanjing Univ Finance & Econ, Coll Informat Engn, Nanjing, Jiangsu, Peoples R China
基金
中国国家自然科学基金;
关键词
Rough c-means clustering; Hybrid imbalanced measure; Rough set theory; FUZZY; SETS;
D O I
10.1016/j.ijar.2014.05.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Traditional c-means clustering partitions a group of objects into a number of non-overlapping sets. Rough sets provide more flexible and objective representation than classical sets with hard partition and fuzzy sets with subjective membership function for a given dataset. Rough c-means clustering and its extensions were introduced and successfully applied in many real life applications in recent years. Each cluster is represented by a reasonable pair of lower and upper approximations. However, the most available algorithms pay no attention to the influence of the imbalanced spatial distribution within a cluster. The limitation of the mean iterative calculation function, with the same weight for all the data objects in a lower or upper approximation, is analyzed. A hybrid imbalanced measure of distance and density for the rough c-means clustering is defined, and a modified rough c-means clustering algorithm is presented in this paper. To evaluate the proposed algorithm, it has been applied to several real world data sets from UCI. The validity of this algorithm is demonstrated by the results of comparative experiments. (C) 2014 Elsevier Inc. All rights reserved.
引用
收藏
页码:1805 / 1818
页数:14
相关论文
共 34 条
[1]  
[Anonymous], 2012, J. Comput. Inf. Syst.
[2]  
Bezdek J. C., 1981, Pattern recognition with fuzzy objective function algorithms
[3]   An initialization method for the K-Means algorithm using neighborhood model [J].
Cao, Fuyuan ;
Liang, Jiye ;
Jiang, Guang .
COMPUTERS & MATHEMATICS WITH APPLICATIONS, 2009, 58 (03) :474-483
[4]  
Chimphlee W, 2006, 2006 International Conference on Hybrid Information Technology, Vol 1, Proceedings, P329
[5]  
Fan Li, 2013, Transactions on Rough Sets XVI: LNCS 7736, P17, DOI 10.1007/978-3-642-36505-8_2
[6]  
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00009-5]
[7]  
Hu QH, 2005, LECT NOTES ARTIF INT, V3613, P494
[8]   An extension to Rough c-means clustering based on decision-theoretic Rough Sets model [J].
Li, Fan ;
Ye, Mao ;
Chen, Xudong .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 2014, 55 (01) :116-129
[9]   Interval set clustering of web users with rough K-means [J].
Lingras, P ;
West, C .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2004, 23 (01) :5-16
[10]  
Lingras P, 2002, PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, P1039, DOI 10.1109/FUZZ.2002.1006647