Optimized fixed-size kernel models for large data sets

被引:97
作者
De Brabanter, K. [1 ]
De Brabanter, J. [1 ,2 ]
Suykens, J. A. K. [1 ]
De Moor, B. [1 ]
机构
[1] Katholieke Univ Leuven, Dept Elect Engn, ESAT SCD, B-3001 Louvain, Belgium
[2] Katholieke Univ Leuven, Hogesch KaHo Sint Lieven, Dept Ind Ingenieur, B-9000 Ghent, Belgium
关键词
Kernel methods; Least squares support vector machines; Classification; Regression; Plug-in estimate; Entropy; Cross-validation; SUPPORT VECTOR MACHINE; CROSS-VALIDATION; BANDWIDTH SELECTION; PARALLEL MIXTURE; BOOTSTRAP CHOICE; SIMPLEX-METHOD; WINDOW WIDTH; DENSITY; CLASSIFICATION; ALGORITHMS;
D O I
10.1016/j.csda.2010.01.024
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
A modified active subset selection method based on quadratic Renyi entropy and a fast cross-validation for fixed-size least squares support vector machines is proposed for classification and regression with optimized tuning process. The kernel bandwidth of the entropy based selection criterion is optimally determined according to the solve-the-equation plug-in method. Also a fast cross-validation method based on a simple updating scheme is developed. The combination of these two techniques is suitable for handling large scale data sets on standard personal computers. Finally, the performance on test data and computational time of this fixed-size method are compared to those for standard support vector machines and v-support vector machines resulting in sparser models with lower computational cost and comparable accuracy. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1484 / 1504
页数:21
相关论文
共 94 条