Collaborative clustering with background knowledge

被引:61
作者
Forestier, G. [1 ]
Gancarski, P. [1 ]
Wemmert, C. [1 ]
机构
[1] Univ Strasbourg, LSIIT, UMR7005, Pole API, F-67412 Illkirch Graffenstaden, France
关键词
Collaborative clustering; Unsupervised learning; Classification; Pattern recognition; Knowledge-guided clustering; CONSENSUS;
D O I
10.1016/j.datak.2009.10.004
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
The aim of collaborative clustering is to make different clustering methods collaborate, in order to reach at an agreement on the partitioning of a common dataset. As different clustering methods can produce different partitioning of the same dataset, finding a consensual clustering from these results is often a hard task. The collaboration aims to make the methods agree on the partitioning through a refinement of their results. This process tends to make the results more similar. In this paper, after the introduction of the collaboration process, we present different ways to integrate background knowledge into it. Indeed, in recent years, the integration of background knowledge in clustering algorithms has been the subject of a lot of interest. This integration often leads to an improvement of the quality of the results. We discuss how such integration in the collaborative process is beneficial and we present experiments in which background knowledge is used to guide collaboration. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:211 / 228
页数:18
相关论文
共 57 条
[1]
[Anonymous], 2004, ICML
[2]
[Anonymous], 2004, P 10 ACM SIGKDD INT, DOI DOI 10.1145/1014052.1014062
[3]
[Anonymous], 2007, Uci machine learning repository
[4]
Cumulative voting consensus method for partitions with a variable number of clusters [J].
Ayad, Hanan G. ;
Kamel, Mohamed S. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008, 30 (01) :160-173
[5]
Basu S, 2004, SIAM PROC S, P333
[6]
Basu S., 2002, P 19 INT C MACH LEAR, P27, DOI [10.5555/645531.656012, DOI 10.5555/645531.656012]
[7]
Berkhin P., 2002, SURVEY CLUSTERING DA
[8]
Data clustering with partial supervision [J].
Bouchachia, A ;
Pedrycz, W .
DATA MINING AND KNOWLEDGE DISCOVERY, 2006, 12 (01) :47-78
[9]
Candillier L, 2006, LECT NOTES COMPUT SC, V4212, P574
[10]