Kernel Matching Pursuit

被引:30
作者
Pascal Vincent
Yoshua Bengio
机构
[1] Université de Montréal,Department of IRO
来源
Machine Learning | 2002年 / 48卷
关键词
kernel methods; matching pursuit; sparse approximation; support vector machines; radial basis functions; boosting;
D O I
暂无
中图分类号
学科分类号
摘要
Matching Pursuit algorithms learn a function that is a weighted sum of basis functions, by sequentially appending functions to an initially empty basis, to approximate a target function in the least-squares sense. We show how matching pursuit can be extended to use non-squared error loss functions, and how it can be used to build kernel-based solutions to machine learning problems, while keeping control of the sparsity of the solution. We present a version of the algorithm that makes an optimal choice of both the next basis and the weights of all the previously chosen bases. Finally, links to boosting algorithms and RBF training procedures, as well as an extensive experimental comparison with SVMs for classification are given, showing comparable results with typically much sparser models.
引用
收藏
页码:165 / 187
页数:22
相关论文
共 34 条
[1]  
Aizerman M.(1964)Theoretical foundations of the potential function method in pattern recognition learning Automation and Remote Control 25 821-837
[2]  
Braverman E.(1991)Orthogonal least squares learning algorithm for radial basis function networks IEEE Transactions on Neural Networks 2 302-309
[3]  
Rozonoer L.(1994)Adaptive time-frequency decompositions Optical Engineering 33 2183-2191
[4]  
Chen S.(1995)Sample compression, learnability, and the Vapnik-Chervonenkis dimension Machine Learning 21 269-304
[5]  
Cowan F.(2002)Structural modelling with sparse kernels Machine Learning special issue on New Methods for Model Combination and Model Selection 48 137-163
[6]  
Grant P.(1986)Relating data compression and learnability Machine Learning 21 1269-304
[7]  
Davis G.(1993)Matching pursuit with time-frequency dictionaries IEEE Trans. Signal Proc. 41 3397-3415
[8]  
Mallat S.(1998)A sparse representation for function approximation Neural Computation 10 1445-1454
[9]  
Zhang Z.(1998)Boosting the margin: A new explanation for the effectiveness of voting methods The Annals of Statistics 26 1651-1686
[10]  
Floyd S.(1997)Comparing support vector machines with Gaussian kernels to radial basis function classifiers IEEE Transactions on Signal Processing 45 2758-2765