A novel multiseed nonhierarchical data clustering technique

被引:44
作者
Chaudhuri, D
Chaudhuri, BB
机构
[1] Computer Vision and Pattern Recognition Unit, Indian Statistical Institute
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1997年 / 27卷 / 05期
关键词
classification minimal spanning tree; multiseed clustering; pattern recognition;
D O I
10.1109/3477.623240
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Clustering techniques such as K-means and Forgy as well as their improved version ISODATA group data around one seed point for each cluster. It is well known that these methods do not work well if the shape of the cluster is elongated or nonconvex. We argue that for a elongated or nonconvex shaped cluster, more than one seed is needed, In this paper a multiseed clustering algorithm is proposed. A density based representative point selection algorithm is used to choose the initial seed points. To assign several seed points to one cluster, a minimal spanning tree guided novel technique is proposed. Also, a border point detection algorithm is proposed for the detection of shape of the cluster. This border in turn signifies whether the cluster is elongated or not. Experimental results show the efficiency of this clustering technique.
引用
收藏
页码:871 / 877
页数:7
相关论文
共 23 条
[1]  
ANDERBERG M, 1973, CLUSTER ANAL APPLICA
[2]  
[Anonymous], HDB PATTERN RECOGNIT
[3]  
[Anonymous], 1982, Pattern recognition: A statistical approach
[4]  
[Anonymous], ANN I STAT MATH
[5]  
ASTRAHAN MM, 1970, AIM124 STANF U STANF
[6]  
Ball G. H., 1965, ISODATA NOVEL METHOD
[7]  
BALL GH, 1967, RADCTR67310 STANF RE
[8]  
BALL GH, 1964, INT C MICR CIRC THEO
[9]  
CHAUDHURI BB, 1993, TRKBCS293 IND STAT I
[10]   A data driven procedure for density estimation with some applications [J].
Chaudhuri, D ;
Chaudhuri, BB ;
Murthy, CA .
PATTERN RECOGNITION, 1996, 29 (10) :1719-1736