A dynamic clustering algorithm for building overlapping clusters

被引:5
作者
Perez-Suarez, Airel [1 ,2 ]
Fco Martinez-Trinidad, Jose [1 ]
Carrasco-Ochoa, Jesus A. [1 ]
Medina-Pagola, Jose E. [2 ]
机构
[1] Natl Inst Astrophys, Dept Comp Sci, Puebla 72840, Mexico
[2] Adv Technol Applicat Ctr, Havana, Cuba
关键词
Data mining; overlapping clustering; graph-based algorithms; TOPIC DETECTION;
D O I
10.3233/IDA-2012-0520
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Clustering is a Data Mining technique which has been widely used in many practical applications. In some of these applications like, medical diagnosis, categorization of digital libraries, topic detection and others, the objects could belong to more than one cluster. However, most of the clustering algorithms generate disjoint clusters. Moreover, processing additions, deletions and modifications of objects in the clustering built so far, without having to rebuild the clustering from the beginning is an issue that has been little studied. In this paper, we introduce DCS, a clustering algorithm which includes a new graph-cover strategy for building a set of clusters that could overlap, and a strategy for dynamically updating the clustering, managing multiple additions and/or deletions of objects. The experimental evaluation conducted over different collections demonstrates the good performance of the proposed algorithm.
引用
收藏
页码:211 / 232
页数:22
相关论文
共 40 条
  • [1] Abella-Perez R., 2010, P INT WORKSH HANDL C, P65
  • [2] [Anonymous], TRR52008001 DOD
  • [3] [Anonymous], 2003, P 29 INT C VER LARG
  • [4] [Anonymous], 2005, P 11 ACM SIGKDD INT
  • [5] [Anonymous], P 5 IB S PATT REC SE
  • [6] Aslam J., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P208, DOI 10.1145/288627.288659
  • [7] Aslam J., 2000, Proceedings of the Ninth International Conference on Information and Knowledge Management. CIKM 2000, P306, DOI 10.1145/354756.354833
  • [8] Aslam J.A., 2004, J GRAPH ALGORITHMS A, V8, P95, DOI [10.7155/jgaa.00084, DOI 10.7155/JGAA.00084]
  • [9] Berry MichaelW., 2004, SURVEY TEXT MINING C
  • [10] Blind camera fingerprinting and image clustering
    Bloy, Greg J.
    [J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (03) : 532 - U1