Fast computation of low-rank matrix approximations

被引:138
作者
Achlioptas, Dimitris [1 ]
McSherry, Frank
机构
[1] Univ Calif Santa Cruz, Santa Cruz, CA 95064 USA
[2] Microsoft Res, Mountain View, CA 94043 USA
关键词
algorithms; theory; singular value decomposition; low rank approximation; sampling;
D O I
10.1145/1219092.1219097
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Given a matrix A, it is often desirable to find a good approximation to A that has low rank. We introduce a simple technique for accelerating the computation of such approximations when A has strong spectral features, that is, when the singular values of interest are significantly greater than those of a random matrix with size and entries similar to A. Our technique amounts to independently sampling and/or quantizing the entries of A, thus speeding up computation by reducing the number of nonzero entries and/or the length of their representation. Our analysis is based on observing that the acts of sampling and quantization can be viewed as adding a random matrix N to A, whose entries are independent random variables with zero-mean and bounded variance. Since, with high probability, N has very weak spectral features, we can prove that the effect of sampling and quantization nearly vanishes when a low-rank approximation to A + N is computed. We give high probability bounds on the quality of our approximation both in the Frobenius and the 2-norm.
引用
收藏
页数:19
相关论文
共 17 条
[1]   On the concentration of eigenvalues of random symmetric matrices [J].
Alon, N ;
Krivelevich, M ;
Vu, VH .
ISRAEL JOURNAL OF MATHEMATICS, 2002, 131 (1) :259-267
[2]  
Azar Y., 2001, P 33 ANN ACM S THEOR, P619, DOI [10.1145/380752.380859, DOI 10.1145/380752.380859]
[3]  
Bell RWD, 1997, OPHTHAL PHYSL OPT, V17, P7
[4]  
BERRY M, 1995, SIAM REV, V41, P573
[5]  
BERRY M, 1995, SIAM REV, V41, P4
[6]   Matrices, vector spaces, and information retrieval [J].
Berry, MW ;
Drmac, Z ;
Jessup, ER .
SIAM REVIEW, 1999, 41 (02) :335-362
[7]  
DEERWESTER S, 1990, J AM SOC INFORM SCI, V41, P391, DOI 10.1002/(SICI)1097-4571(199009)41:6<391::AID-ASI1>3.0.CO
[8]  
2-9
[9]   Clustering large graphs via the Singular Value Decomposition [J].
Drineas, P ;
Frieze, A ;
Kannan, R ;
Vempala, S ;
Vinay, V .
MACHINE LEARNING, 2004, 56 (1-3) :9-33
[10]  
Drineas P, 2003, SIAM PROC S, P223