Local and global approaches of affinity propagation clustering for large scale data

被引:17
作者
Dingyin XIA Fei WU Xuqing ZHANG Yueting ZHUANG School of Computer Science and Technology Zhejiang University Hangzhou China [310027 ]
机构
关键词
Clustering; Affinity propagation; Large scale data; Partition affinity propagation; Landmark affinity propagation;
D O I
暂无
中图分类号
TP309 [安全保密];
学科分类号
081201 ; 0839 ; 1402 ;
摘要
Recently a new clustering algorithm called 'affinity propagation' (AP) has been proposed, which efficiently clustered sparsely related data by passing messages between data points. However, we want to cluster large scale data where the similarities are not sparse in many cases. This paper presents two variants of AP for grouping large scale data with a dense similarity matrix. The local approach is partition affinity propagation (PAP) and the global method is landmark affinity propagation (LAP). PAP passes messages in the subsets of data first and then merges them as the number of initial step of iterations; it can effectively reduce the number of iterations of clustering. LAP passes messages between the landmark data points first and then clusters non-landmark data points; it is a large global approximation method to speed up clustering. Experiments are conducted on many datasets, such as random data points, manifold subspaces, images of faces and Chinese calligraphy, and the results demonstrate that the two ap-proaches are feasible and practicable.
引用
收藏
页码:1373 / 1381
页数:9
相关论文
共 1 条
[1]  
Cure: an efficient clustering algorithm for large databases[J] . Sudipto Guha,Rajeev Rastogi,Kyuseok Shim.Information Systems . 2001 (1)