SYMBOLIC CLUSTERING USING A NEW SIMILARITY MEASURE

被引:127
作者
GOWDA, KC [1 ]
DIDAY, E [1 ]
机构
[1] INST NATL RECH INFORMAT & AUTOMAT,F-78153 LE CHESNAY,FRANCE
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS | 1992年 / 22卷 / 02期
关键词
D O I
10.1109/21.148412
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
A hierarchical, agglomerative, symbolic clustering methodology based on a new similarity measure that takes into consideration the "position," "span," and "content," of symbolic objects is proposed. The similarity measure used is of a new type in the sense that it is not just another aspect of dissimilarity such as the reciprocal of a distance measure. The clustering methodology forms composite symbolic objects using a Cartesian join operator when two symbolic objects are merged. The maximum and minimum similarity values at various merging levels enable the determination of the number of clusters in the data set. The composite symbolic objects representing different clusters give a description of the resulting classes and lead to knowledge acquisition. The algorithm appears very versatile as it is capable of discerning clusters in data sets made up of numeric as well as symbolic objects consisting of different types and combinations of qualitative and quantitative feature values. In particular, the algorithm is applied on two data sets of fat-oil and microcomputers.
引用
收藏
页码:368 / 378
页数:11
相关论文
共 25 条
[1]  
[Anonymous], 1988, ALGORITHMS CLUSTERIN
[2]  
BOCK HH, 1987, CLASSIFICATION RELAT
[3]   CONCEPTUAL CLUSTERING IN KNOWLEDGE ORGANIZATION [J].
CHENG, Y ;
FU, KS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1985, 7 (05) :592-598
[4]  
DIDAY E, 1989, 2ND P INT WORKSH DAT
[5]  
DIDAY E, 1987, RECENT DEV CLUSTERIN
[6]  
Diday E., 1980, COMMUNICATION CYBERN, V10, P47
[7]  
DIDAY E, 1980, 5TH P C PATT REC MIA
[8]  
Diday E., 1988, CLASSIFICATION RELAT
[9]  
DIDAY E, 1989, DATA ANAL ALGORITHMS
[10]  
Duda R. O., 1973, PATTERN CLASSIFICATI, V3