Computational experience on four algorithms for the hard clustering problem

被引:52
作者
AlSultan, KS [1 ]
Khan, MM [1 ]
机构
[1] KING FAHD UNIV PETR & MINERALS, DATA PROC CTR, DHAHRAN 31261, SAUDI ARABIA
关键词
clustering problem; k-means algorithm; simulated annealing; tabu search; genetic algorithm; computational comparison;
D O I
10.1016/0167-8655(95)00122-0
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we consider the problem of clustering m objects in c clusters. The objects are represented by points in n-dimensional Euclidean space, and the objective is to classify these m points into c clusters such that the distance between points within a cluster and its center is minimized. The problem is a difficult optimization problem due to the fact that: it posseses many local minima. Several algorithms have been developed to solve this problem which include the k-means algorithm, the simulated annealing algorithm, the tabu search algorithm, and the genetic algorithm. Zn this paper, we study the four algorithms and compare their computational performance for the clustering problem. We test these algorithms on several clustering problems from the literature as well as several random problems and we report on our computational experience.
引用
收藏
页码:295 / 308
页数:14
相关论文
共 26 条
[1]   A TABU SEARCH APPROACH TO THE CLUSTERING PROBLEM [J].
ALSULTAN, KS .
PATTERN RECOGNITION, 1995, 28 (09) :1443-1451
[2]  
ALSULTAN KS, 1987, THESIS KING FAHD U P
[3]  
ALSULTAN KS, 1995, COMPUTATIONAL EXPERI
[4]  
BALL GH, 1964, P INT C MICR CIRC TH
[5]  
BHUYAN JN, 1991, P 4 INT C GEN ALG SA
[6]   A PRACTICAL APPLICATION OF SIMULATED ANNEALING TO CLUSTERING [J].
BROWN, DE ;
HUNTLEY, CL .
PATTERN RECOGNITION, 1992, 25 (04) :401-412
[7]   N-DIMENSIONAL LOCATION MODELS - APPLICATION TO CLUSTER ANALYSIS [J].
COOPER, L .
JOURNAL OF REGIONAL SCIENCE, 1973, 13 (01) :41-54
[8]  
DIEHR G, 1975, IEEE T COMPUT, V24
[9]  
FORGY EW, 1965, BIOMETRICS, V21, P768
[10]  
Golberg D.E., 1989, Genetic Algorithm in Search, Optimization and Machine Learning