VALIDITY STUDIES IN CLUSTERING METHODOLOGIES

被引:181
作者
DUBES, R
JAIN, AK
机构
[1] Department of Computer Science, Michigan State University, East Lansing
基金
美国国家科学基金会;
关键词
Cluster validity; Clustering; Clustering tendency; Compactness; Global fit; Hierarchical structure; Intrinsic dimensionality; Isolation;
D O I
10.1016/0031-3203(79)90034-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering algorithms tend to generate clusters even when applied to random data. This paper provides a semi-tutorial review of the state-of-the-art in cluster validity, or the verification of results from clustering algorithms. The paper covers ways of measuring clustering tendency, the fit of hierarchical and partitional structures and indices of compactness and isolation for individual clusters. Included are structural criteria for validating clusters and the factors involved in choosing criteria, according to which the literature of cluster validity is classified. An application to speaker identification demonstrates several indices. The development of new clustering techniques and the wide availability of clustering programs necessitates vigorous research in cluster validity. © 1979.
引用
收藏
页码:235 / 254
页数:20
相关论文
共 89 条
[1]   APPROACH TO WORKLOAD CHARACTERIZATION PROBLEM [J].
AGRAWALA, AK ;
MOHR, JM ;
BRYANT, RM .
COMPUTER, 1976, 9 (06) :18-32
[2]  
ANDERBERG MR, 1973, CLUSTER ANAL APPLICA
[3]   CLUSTERING REPRESENTATIONS OF GROUP OVERLAP [J].
ARABIE, P .
JOURNAL OF MATHEMATICAL SOCIOLOGY, 1977, 5 (01) :113-128
[4]  
BACKER E, 1978, CLUSTER ANAL OPTIMAL
[5]   MEASURING POWER OF HIERARCHICAL CLUSTER-ANALYSIS [J].
BAKER, FB ;
HUBERT, LJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1975, 70 (349) :31-38
[6]   GRAPH-THEORETIC APPROACH TO GOODNESS-OF-FIT IN COMPLETE-LINK HIERARCHICAL CLUSTERING [J].
BAKER, FB ;
HUBERT, LJ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1976, 71 (356) :870-878
[7]   STABILITY OF 2 HIERARCHICAL GROUPING TECHNIQUES CASE 1 - SENSITIVITY TO DATA ERRORS [J].
BAKER, FB .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1974, 69 (346) :440-445
[8]  
BALL GH, 1965, FAL P JOINT COMP C, P533
[9]  
Bezdek J. C., 1973, Journal of Cybernetics, V3, P58, DOI 10.1080/01969727308546047
[10]   MIXTURE MODEL TESTS OF CLUSTER-ANALYSIS - ACCURACY OF 4 AGGLOMERATIVE HIERARCHICAL METHODS [J].
BLASHFIELD, RK .
PSYCHOLOGICAL BULLETIN, 1976, 83 (03) :377-388