An equivalence between sparse approximation and support vector machines

被引:292
作者
Girosi, F
机构
[1] MIT, Ctr Biol & Computat Learning, Cambridge, MA 02139 USA
[2] MIT, Artificial Intelligence Lab, Cambridge, MA 02139 USA
关键词
D O I
10.1162/089976698300017269
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This article shows a relationship between two different approximation techniques: the support vector machines (SVM), proposed by V. Vapnik (1995) and a sparse approximation scheme that resembles the basis pursuit denoising algorithm (Chen, 1995; Chen, Donoho, & Saunders, 1995). SVM is a technique that can be derived from the structural risk minimization principle (Vapnik, 1982) and can be used to estimate the parameters of several different approximation schemes, including radial basis functions, algebraic and trigonometric polynomials, B-splines, and some forms of multilayer perceptrons. Basis pursuit denoising is a sparse approximation technique in which a function is reconstructed by using a small number of basis functions chosen from a large set (the dictionary). We show that if the data are noiseless, the modified version of basis pursuit denoising proposed in this article is equivalent to SVM in the following sense: if applied to the same data set, the two techniques give the same solution, which is obtained by solving the same quadratic programming problem. In the appendix, we present a derivation of the SVM technique in the framework of regularization theory, rather than statistical learning theory, establishing a connection between SVM, sparse approximation, and regularization theory.
引用
收藏
页码:1455 / 1480
页数:26
相关论文
共 34 条
  • [1] [Anonymous], 1993, Ten Lectures of Wavelets
  • [2] [Anonymous], 1995, THESIS STANFORD U
  • [3] [Anonymous], 1982, ESTIMATION DEPENDENC
  • [4] THEORY OF REPRODUCING KERNELS
    ARONSZAJN, N
    [J]. TRANSACTIONS OF THE AMERICAN MATHEMATICAL SOCIETY, 1950, 68 (MAY) : 337 - 404
  • [5] BERTERO M, 1986, INVERSE PROBLEMS
  • [6] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [7] Breiman L., 1993, BETTER SUBSET SELECT
  • [8] CHEN S, 1995, 479 STANF U DEP STAT
  • [9] COCHRAN JA, 1972, ANAL LINEAR INTEGRAL
  • [10] ENTROPY-BASED ALGORITHMS FOR BEST BASIS SELECTION
    COIFMAN, RR
    WICKERHAUSER, MV
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) : 713 - 718