Genetic algorithm-based clustering technique

被引:1016
作者
Maulik, U
Bandyopadhyay, S
机构
[1] Govt Engn Coll, Dept Comp Sci, Kalyani, Nadia, India
[2] Indian Stat Inst, Machine Intelligence Unit, Calcutta 700035, W Bengal, India
关键词
genetic algorithms; clustering metric; K-means algorithm; real encoding; Euclidean distance;
D O I
10.1016/S0031-3203(99)00137-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A genetic algorithm-based clustering technique, called GA-clustering, is proposed in this article. The searching capability of genetic algorithms is exploited in order to search for appropriate cluster centres in the feature space such that a similarity metric of the resulting clusters is optimized. The chromosomes, which are represented as strings of real numbers, encode the centres of a fixed number of clusters. The superiority of the GA-clustering algorithm over the commonly used K-means algorithm is extensively demonstrated for four artificial and three real-life data sets. (C) 2000 Pattern Recognition Society. Published by Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:1455 / 1465
页数:11
相关论文
共 32 条
  • [1] Anderberg M.R., 1973, Probability and Mathematical Statistics
  • [2] [Anonymous], 1989, GENETIC ALGORITHM SE
  • [3] [Anonymous], P 4 INT C GEN ALG
  • [4] [Anonymous], 1991, Handbook of genetic algorithms
  • [5] PATTERN-CLASSIFICATION WITH GENETIC ALGORITHMS
    BANDYOPADHYAY, S
    MURTHY, CA
    PAL, SK
    [J]. PATTERN RECOGNITION LETTERS, 1995, 16 (08) : 801 - 808
  • [6] Genetic algorithm with elitist model and its convergence
    Bhandari, D
    Murthy, CA
    Pal, SK
    [J]. INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 1996, 10 (06) : 731 - 747
  • [7] Devijver P., 1982, PATTERN RECOGN
  • [8] CLUSTERING TECHNIQUES - USERS DILEMMA
    DUBES, R
    JAIN, AK
    [J]. PATTERN RECOGNITION, 1976, 8 (04) : 247 - 260
  • [9] ESHELMAN LJ, 1995, P 6 INT C GEN ALG
  • [10] Filho J. L. R., 1994, COMPUTER, V27, P28