Some new indexes of cluster validity

被引:786
作者
Bezdek, JC [1 ]
Pal, NR
机构
[1] Univ W Florida, Dept Comp Sci, Pensacola, FL 32514 USA
[2] Indian Stat Inst, Machine Intelligence Unit, Calcutta 700035, W Bengal, India
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1998年 / 28卷 / 03期
关键词
cluster validity; Davies-Bouldin index; generalized Dunn's index; hard c-means; modified Hubert statistic; single linkage;
D O I
10.1109/3477.678624
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We review two clustering algorithms (hard c-means and single linkage) and three indexes of crisp cluster validity (Hubert's statistics, the Davies-Bouldin index, and Dunn's index). We illustrate two deficiencies of Dunn's index which make it overly sensitive to noisy clusters and propose several generalizations of it that are not as brittle to outliers in the clusters. Our numerical examples show that the standard measure of interset distance (the minimum distance between points in a pair of sets) is the worst (least reliable) measure upon which to base cluster validation indexes when the clusters are expected to form volumetric clouds. Experimental results also suggest that intercluster separation plays a more important role in cluster validation than cluster diameter. Our simulations show that while Dunn's original index has operational flaws, the concept it embodies provides a rich paradigm for validation of partitions that have cloud-like clusters. Five of our generalized Dunn's indexes provide the best validation results for the simulations presented.
引用
收藏
页码:301 / 315
页数:15
相关论文
共 15 条
[1]  
Anderson E., 1935, Bulletin of the American IRIS Society, V59, P2
[2]  
[Anonymous], Pattern Recognition With Fuzzy Objective Function Algorithms
[3]   A geometric approach to cluster validity for normal mixtures [J].
J. C. Bezdek ;
W. Q. Li ;
Y. Attikiouzel ;
M. Windham .
Soft Computing, 1997, 1 (4) :166-179
[4]   CLUSTER SEPARATION MEASURE [J].
DAVIES, DL ;
BOULDIN, DW .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1979, 1 (02) :224-227
[5]  
Dunn J.C., 1973, J CYBERNETICS, V3, P32, DOI DOI 10.1080/01969727308546046
[6]  
Everitt B, 1978, GRAPHICAL TECHNIQUES
[7]  
Hart P.E., 1973, Pattern recognition and scene analysis
[8]   COMPARING PARTITIONS [J].
HUBERT, L ;
ARABIE, P .
JOURNAL OF CLASSIFICATION, 1985, 2 (2-3) :193-218
[9]  
Jian A., 1988, ALGORITHMS CLUSTERIN
[10]  
Krishnapuram R., 1993, IEEE Transactions on Fuzzy Systems, V1, P98, DOI 10.1109/91.227387