Local information-based fast approximate spectral clustering

被引:24
作者
Cao, Jiangzhong [1 ,2 ]
Chen, Pei [1 ]
Dai, Qingyun [2 ]
Ling, Wing-Kuen [2 ]
机构
[1] Sun Yat Sen Univ, Sch Informat Sci & Technol, Guangzhou 510006, Guangdong, Peoples R China
[2] Guangdong Univ Technol, Sch Informat Engn, Guangzhou 510006, Guangdong, Peoples R China
基金
中国国家自然科学基金;
关键词
Spectral clustering; Local information; Sparse affinity graph; Local interpolation;
D O I
10.1016/j.patrec.2013.11.005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Spectral clustering has become one of the most popular clustering approaches in recent years. However, its high computational complexity prevents its application to large-scale datasets. To address this complexity, approximate spectral clustering methods have been proposed. In these methods, computational costs are reduced by using approximation techniques, such as the Nystrom method, or by constructing a smaller representative dataset on which spectral clustering is performed. However, the computational efficiency of these approximation methods is achieved at the cost of performance degradation. In this paper, we propose an efficient approximate spectral clustering method in which clustering performance is improved by utilizing local information among the data, while the scalability to the large-scale datasets is retained. Specifically, we improve the approximate spectral clustering method in two aspects. First, a sparse affinity graph is adopted to improve the performance of spectral clustering on the small representative dataset. Second, local interpolation is utilized to improve the extension of the clustering result. Experiments are conducted on several real-world datasets, showing that the proposed method is efficient and outperforms the state-of-the-art approximate spectral clustering algorithms. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:63 / 69
页数:7
相关论文
共 25 条
[1]   Multiway Spectral Clustering with Out-of-Sample Extensions through Weighted Kernel PCA [J].
Alzate, Carlos ;
Suykens, Johan A. K. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2010, 32 (02) :335-347
[2]  
[Anonymous], 030501 U WASH DEP CO
[3]  
[Anonymous], 2009, MNIST DATABASE HANDW
[4]  
[Anonymous], 1998, COMBINATORIAL OPTIMI
[5]  
Bache K., 2013, UCI Machine Learning Repository
[6]   Parallel Spectral Clustering in Distributed Systems [J].
Chen, Wen-Yen ;
Song, Yangqiu ;
Bai, Hongjie ;
Lin, Chih-Jen ;
Chang, Edward Y. .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2011, 33 (03) :568-586
[7]  
Chen X., 2011, P AAAI C ART INT, P7
[8]  
CHUNG FRK, 1997, REGIONAL C SERIES MA, P92
[9]   Spectral grouping using the Nystrom method [J].
Fowlkes, C ;
Belongie, S ;
Chung, F ;
Malik, J .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2004, 26 (02) :214-225
[10]   On the quality of spectral separators [J].
Guattery, S ;
Miller, GL .
SIAM JOURNAL ON MATRIX ANALYSIS AND APPLICATIONS, 1998, 19 (03) :701-719