Automatic model selection for the optimization of SVM kernels

被引:156
作者
Ayat, NE
Cheriet, M
Suen, CY
机构
[1] LIVIA, ETS, Montreal, PQ H3C 1K3, Canada
[2] Concordia Univ, CENPARMI, Montreal, PQ H3G 1M8, Canada
基金
加拿大自然科学与工程研究理事会;
关键词
model selection; SVM; kernel; empirical error; VC; GACV;
D O I
10.1016/j.patcog.2005.03.011
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This approach aims to optimize the kernel parameters and to efficiently reduce the number of support vectors, so that the generalization error can be reduced drastically. The proposed methodology suggests the use of a new model selection criterion based on the estimation of the probability of error of the SVM classifier. For comparison, we considered two more model selection criteria: GACV ('Generalized Approximate Cross-Validation') and VC ('Vapnik-Chernovenkis') dimension. These criteria are algebraic estimates of upper bounds of the expected error. For the former, we also propose a new minimization scheme. The experiments conducted on a bi-class problem show that we can adequately choose the SVM hyper-parameters using the empirical error criterion. Moreover, it turns out that the criterion produces a less complex model with fewer support vectors. For multi-class data, the optimization strategy is adapted to the one-against-one data partitioning. The approach is then evaluated on images of handwritten digits from the USPS database. (c) 2005 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:1733 / 1745
页数:13
相关论文
共 42 条
[1]  
AYAT ME, 2002, P IEEE INT C PATT RE, V3, P30331
[2]  
Ayat NE, 2002, LECT NOTES COMPUT SC, V2388, P354
[3]   Generalized discriminant analysis using a kernel approach [J].
Baudat, G ;
Anouar, FE .
NEURAL COMPUTATION, 2000, 12 (10) :2385-2404
[4]  
BLANZ V, 1996, LNCS, V1112, P251
[5]  
BROMLEY J, 1991, 1135991081916TM ATT
[6]  
Burges CJC, 1997, ADV NEUR IN, V9, P375
[7]  
Chapelle O, 2000, ADV NEUR IN, V12, P230
[8]   Choosing multiple parameters for support vector machines [J].
Chapelle, O ;
Vapnik, V ;
Bousquet, O ;
Mukherjee, S .
MACHINE LEARNING, 2002, 46 (1-3) :131-159
[9]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[10]  
Courant, 1953, METHODS MATH PHYS