Stable Initialization Scheme for K-Means Clustering

被引：15

作者：

XU Junling XU Baowen ZHANG Weifeng ZHANG Wei HOU Jun School of Computer Science and Engineering Southeast University Nanjing Jiangsu China State Key Laboratory of Software Engineering Wuhan University Wuhan Hubei China Department of Computer Nanjing University of Posts and Telecommunications Nanjing Jiangsu China ^{[1
,1
,2
,3
,1
,1
,1
,211189
,2
,430072
,3
,210003
]}

机构：

来源：

Wuhan University Journal of Natural Sciences | 2009年 / 14卷 / 01期

关键词：

clustering; unsupervised learning; K-means; initiali-zation;

D O I：

暂无

中图分类号：

TP391.41 [];

学科分类号：

080203 ;

摘要：

Though K-means is very popular for general clustering, its performance, which generally converges to numerous local minima, depends highly on initial cluster centers. In this paper a novel initialization scheme to select initial cluster centers for K-means clustering is proposed. This algorithm is based on reverse nearest neighbor (RNN) search which retrieves all points in a given data set whose nearest neighbor is a given query point. The initial cluster centers computed using this methodology are found to be very close to the desired cluster centers for iterative cluster-ing algorithms. This procedure is applicable to clustering algo-rithms for continuous data. The application of the proposed algo-rithm to K-means clustering algorithm is demonstrated. An ex-periment is carried out on several popular datasets and the results show the advantages of the proposed method.

引用

页码：24 / 28

页数：5