An on-line agglomerative clustering method for nonstationary data

被引:41
作者
Guedalia, ID [1 ]
London, M
Werman, M
机构
[1] Hebrew Univ Jerusalem, Ctr Neural Computat, IL-91904 Jerusalem, Israel
[2] Hebrew Univ Jerusalem, Inst Comp Sci, IL-91904 Jerusalem, Israel
[3] Hebrew Univ Jerusalem, Inst Life Sci, IL-91904 Jerusalem, Israel
关键词
D O I
10.1162/089976699300016755
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
An on-line agglomerative clustering algorithm for nonstationary data is described. Three issues are addressed. The first regards the temporal aspects of the data. The clustering of stationary data by the proposed algorithm is comparable to the other popular algorithms tested (batch and on-line). The second issue addressed is the number of clusters required to represent the data. The algorithm provides an efficient framework to determine the natural number of clusters given the scale of the problem. Finally, the proposed algorithm implicitly minimizes the local distortion, a measure that takes into account clusters with relatively small mass. In contrast, most existing on-line clustering methods assume stationarity of the data. When used to cluster nonstationary data, these methods fail to generate a good representation. Moreover, most current algorithms are computationally intensive when determining the correct number of clusters. These algorithms tend to neglect clusters of small mass due to their minimization of the global distortion (Energy).
引用
收藏
页码:521 / 540
页数:20
相关论文
共 16 条
[1]   A CLUSTERING TECHNIQUE FOR SUMMARIZING MULTIVARIATE DATA [J].
BALL, GH ;
HALL, DJ .
BEHAVIORAL SCIENCE, 1967, 12 (02) :153-&
[2]   COMPLEXITY OPTIMIZED DATA CLUSTERING BY COMPETITIVE NEURAL NETWORKS [J].
BUHMANN, J ;
KUHNEL, H .
NEURAL COMPUTATION, 1993, 5 (01) :75-88
[3]  
CARPENTER GA, 1990, PARALLEL PROCESSING IN NEURAL SYSTEMS AND COMPUTERS, P383
[4]   GROWING CELL STRUCTURES - A SELF-ORGANIZING NETWORK FOR UNSUPERVISED AND SUPERVISED LEARNING [J].
FRITZKE, B .
NEURAL NETWORKS, 1994, 7 (09) :1441-1460
[5]   UNSUPERVISED OPTIMAL FUZZY CLUSTERING [J].
GATH, I ;
GEVA, AB .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1989, 11 (07) :773-781
[6]  
GUEDALIA ID, 1995, 953606 ASAE
[7]  
Hart P.E., 1973, Pattern recognition and scene analysis
[8]  
Jain K, 1988, Algorithms for clustering data
[9]  
LINDE Y, 1980, IEEE T COMMUN, V28, P1
[10]  
MacQueen J., 1967, P 5 BERKELEY S MATH, V1, P281