Hierarchical initialization approach for K-Means clustering

被引:57
作者
Lu, J. F. [1 ]
Tang, J. B. [1 ]
Tang, Z. M. [1 ]
Yang, J. Y. [1 ]
机构
[1] Nanjing Univ Sci & Technol, Dept Comp Sci, Nanjing 210094, Peoples R China
关键词
K-Means algorithm; K-Means initialization; voronoi tessellation; hierarchical technique;
D O I
10.1016/j.patrec.2007.12.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A hierarchical initialization approach is proposed to the K-Means clustering problem. The core of the proposed method is to treat the clustering problem as a weighted clustering problem so as to find better initial cluster centers based on the hierarchical approach. The experimental results show that the proposed approach needs less iteration time compared with existing approaches and has better performance in terms of convergence speed and ability to reduce the impact of noises. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:787 / 795
页数:9
相关论文
共 13 条
[1]   Centroidal Voronoi tessellations: Applications and algorithms [J].
Du, Q ;
Faber, V ;
Gunzburger, M .
SIAM REVIEW, 1999, 41 (04) :637-676
[2]  
Duda R. O., 1973, Pattern Classification
[3]  
Fayyad U., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P194
[4]  
Fisher D. H., 1987, Machine Learning, V2, P139, DOI 10.1007/BF00114265
[5]  
FORGY EW, 1965, BIOMETRICS, V21, P768
[6]  
Hinneburg A., 1998, Proceedings Fourth International Conference on Knowledge Discovery and Data Mining, P58
[7]  
KARYPIS G, 1999, IEEE COMPUT, V99, P68
[8]  
Kaufman L., 1990, Finding Groups in Data: An Introduction to Cluster Analysis, DOI DOI 10.1002/9780470316801
[9]  
MacQueen J., 1967, Proc fifth Berkeley Symp Math Stat Probab, V1, P281
[10]   An empirical comparison of four initialization methods for the K-Means algorithm [J].
Peña, JM ;
Lozano, JA ;
Larrañaga, P .
PATTERN RECOGNITION LETTERS, 1999, 20 (10) :1027-1040