A fast SVM training algorithm

被引:27
作者
Dong, JX [1 ]
Suen, CY
机构
[1] Concordia Univ, Ctr Pattern Recognit & Machine Intelligent, Montreal, PQ H3G 1M8, Canada
[2] Concordia Univ, Dept Comp Sci, Montreal, PQ H3G 1M8, Canada
关键词
support vector machine; kernel caching; working set selection; digest; shrinking; handwritten digit recognition;
D O I
10.1142/S0218001403002423
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A fast support vector machine (SVM) training algorithm is proposed under SVM's decomposition framework by effectively integrating kernel caching, digest and shrinking policies and stopping conditions. Kernel caching plays a key role in reducing the number of kernel evaluations by maximal reusage of cached kernel elements. Extensive experiments have been conducted on a large handwritten digit database MNIST to show that the proposed algorithm is much faster than Keerthi et al.'s improved SMO, about nine times. Combined with principal component analysis, the total training for ten one-against-the-rest classifiers on MNIST took less than an hour. Moreover, the proposed fast algorithm speeds up SVM training without sacrificing the generalization performance. The 0.6% error rate on MNIST test set has been achieved. The promising scalability of the proposed scheme paves a new way to solve more large-scale learning problems in other domains such as data mining.
引用
收藏
页码:367 / 384
页数:18
相关论文
共 24 条
[1]  
[Anonymous], 1999, CD9914 NAT U SING DE
[2]  
[Anonymous], 2005, EUR C MACH LEARN
[3]  
[Anonymous], 1990, SUPPORT VECTOR LEARN
[4]  
Bishop C. M., 1995, NEURAL NETWORKS PATT
[5]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[6]  
Burges CJC, 2000, ADV NEUR IN, V12, P223
[7]  
Cristianini N, 2000, Intelligent Data Analysis: An Introduction
[8]   Training invariant support vector machines [J].
Decoste, D ;
Schölkopf, B .
MACHINE LEARNING, 2002, 46 (1-3) :161-190
[9]  
DONG J, 2002, FAST SVM TRAINING AL
[10]  
DONG J, 2002, P INT C PATT REC QUE