Efficient SVM training using low-rank kernel representations

被引:386
作者
Fine, S
Scheinberg, K
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Human Language Technol Dept, Yorktown Hts, NY 10598 USA
[2] IBM Corp, Thomas J Watson Res Ctr, Dept Math Sci, Yorktown Hts, NY 10598 USA
关键词
support vector machine; interior point method; Cholesky product form; Cholesky factorization; approximate solution;
D O I
10.1162/15324430260185619
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
SVM training is a convex optimization problem which scales with the training set size rather than the feature space dimension. While this is usually considered to be a desired quality, in large scale problems it may cause training to be impractical. The common techniques to handle this difficulty basically build a solution by solving a sequence of small scale subproblems. Our current effort is concentrated on the rank of the kernel matrix as a source for further enhancement of the training procedure. We first show that for a low rank kernel matrix it is possible to design a better interior point method (IPM) in terms of storage requirements as well as computational complexity. We then suggest an efficient use of a known factorization technique to approximate a given kernel matrix by a low rank matrix, which in turn will be used to feed the optimizer. Finally. we derive an upper bound on the change in the objective function value based on the approximation error and the number of active constraints (support vectors). This bound is general in the sense that it holds regardless of the approximation method.
引用
收藏
页码:243 / 264
页数:22
相关论文
共 26 条
[1]   A modified Schur-complement method for handling dense columns in interior-point methods for linear programming [J].
Andersen, KD .
ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE, 1996, 22 (03) :348-356
[2]  
[Anonymous], 1998, P 15 INT C MACHINE L
[3]  
Bennett JM., 1965, NUMER MATH, V7, P217
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[6]  
Choi IC, 1990, ORSA J. Comput., V2, P304
[7]  
FINE S, 2001, P INT C AC SPEECH SI
[8]  
FINE S, 2001, UNPUB MATH PROGRAMMI
[9]   MODIFICATION OF LDLT FACTORIZATIONS [J].
FLETCHER, R ;
POWELL, MJD .
MATHEMATICS OF COMPUTATION, 1974, 28 (128) :1067-1087
[10]   METHODS FOR COMPUTING AND MODIFYING LDV FACTORS OF A MATRIX [J].
GILL, PE ;
MURRAY, W ;
SAUNDERS, MA .
MATHEMATICS OF COMPUTATION, 1975, 29 (132) :1051-1077