A DYNAMIC APPROACH FOR CLUSTERING DATA

被引:21
作者
GARCIA, JA
FDEZVALDIVIA, J
CORTIJO, FJ
MOLINA, R
机构
[1] Departamento de Ciencias de la Computatión e I.A., E.T.S. de Ingeniería Informática, Universidad de Granada
关键词
DYNAMIC SCHEME OF CLUSTERING; NON-PIECEWISE LINEAR SEPARABLE CLUSTERS; DISSIMILARITY MEASURE; FIRST DERIVATIVE;
D O I
10.1016/0165-1684(95)00023-7
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
This paper introduces a new method for clustering data using a dynamic scheme. An appropriate partitioning is obtained based on both a dissimilarity measure between pairs of entities as well as a dynamic procedure of splitting. A dissimilarity function is defined by using the cost of the optimum path from a datum to each entity on a graph, with the cost of a path being defined as the greatest distance between two successive vertices on the path. The procedure of clustering is dynamic in the sense that the initial problem of determining a partition into an unknown number of natural groupings has been reduced to a sequence of only two class splitting stages. Having arisen from any particular application, the proposed approach could be effective for many domains, and it is especially successful to identify clusters if there is lack of prior knowledge about the data set. The usefulness of the dynamic algorithm to deal with elongated or non-piecewise linear separable clusters as;well as sparse and dense groupings is demonstrated with several data sets.
引用
收藏
页码:181 / 196
页数:16
相关论文
共 23 条
[1]  
Brown, Huntley, Garvey, Clustering of homogeneous subsets, Pattern Recognition Lett., 12, pp. 401-408, (1991)
[2]  
Chan, Cheung, Clustering of clusters, Pattern Recognition, 2, pp. 211-217, (1992)
[3]  
Dijkstra, A note on two problems in connexion with graphs, Numer. Math., 1, pp. 269-271, (1959)
[4]  
Dubes, Jain, Clustering Methodology in Exploratory Data Analysis, pp. 113-228, (1980)
[5]  
Duda, Hart, Pattern Classification and Scene Analysis, (1973)
[6]  
Garcia, Fdez-Valdivia, Boundary simplification in cartography preserving the characteristics of the shape features, Comput. Geosci., 20, 3, pp. 349-368, (1994)
[7]  
Garcia, Fdez-Valdivia, Representing planar curves by using a scale vector, Pattern Recognition Letters, 15, pp. 937-942, (1994)
[8]  
Gnanadesikan, Methods for Statistical Data Analysis of Multivariate Observations, (1977)
[9]  
Hartigan, Clustering Algorithms, (1975)
[10]  
Hubert, Some applications of graph theory to clustering, Psychometrika, 39, pp. 283-309, (1974)