An efficient k′-means clustering algorithm

被引:263
作者
Zalik, Krista Rizman [1 ]
机构
[1] Univ Maribor, Fac Nat Sci & Math, Dept Math & Comp Sci, SLO-2000 Maribor, Slovenia
关键词
clustering analysis; k-means; cluster number; cost-function; rival penalized;
D O I
10.1016/j.patrec.2008.02.014
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper introduces k-means algorithm that performs correct clustering without pre-assigning the exact number of clusters. This is achieved by minimizing a suggested cost-function. The cost-function extends the mean-square-error cost-function of k-means. The algorithm consists of two separate steps. The first is a pre-processing procedure that performs initial clustering and assigns at least one seed point to each cluster. During the second step, the seed-points are adjusted to minimize the cost-function. The algorithm automatically penalizes any possible winning chances for all rival seed-points in subsequent iterations. When the cost-function reaches a global minimum, the correct number of clusters is determined and the remaining seed points are located near the centres of actual clusters. The simulated experiments described in this paper confirm good performance of the proposed algorithm. (c) 2008 Elsevier B.V. All rights reserved.
引用
收藏
页码:1385 / 1391
页数:7
相关论文
共 19 条