An Improved Algorithm for the Solution of the Regularization Path of Support Vector Machine

被引:46
作者
Ong, Chong-Jin [1 ,2 ]
Shao, Shiyun [1 ]
Yang, Jianbo [1 ]
机构
[1] Natl Univ Singapore, Dept Mech Engn, Singapore 117576, Singapore
[2] Singapore MIT Alliance, Singapore 117576, Singapore
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2010年 / 21卷 / 03期
关键词
Numerical solutions of SVM; parametric solution of SVM; regularization path; support vector machine (SVM); SELECTION;
D O I
10.1109/TNN.2009.2039000
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes an improved algorithm for the numerical solution to the support vector machine (SVM) classification problem for all values of the regularization parameter C. The algorithm is motivated by the work of Hastie et al. and follows the main idea of tracking the optimality conditions of the SVM solution for ascending value of C. It differs from Hastie's approach in that the tracked path is not assumed to be 1-D. Instead, a multidimensional feasible space for the optimality condition is used to solve the tracking problem. Such a treatment allows the algorithm to properly handle data sets which Hastie's approach fails. These data sets are characterized by the presence of linearly dependent points (in the kernel space), duplicate points, or nearly duplicate points. Such data sets are quite common among many real-world data, especially those with nominal features. Other contributions of this paper include a unifying formulation of the tracking process in the form of a linear programming problem, update formula for the linear programs, considerations that guard against accumulation of errors resulting from the use of incremental updates, and routines to speed up the algorithm. The algorithm is implemented under the Matlab environment and is available for download. Experiments with several data sets including data set having up to several thousand data points are reported.
引用
收藏
页码:451 / 462
页数:12
相关论文
共 24 条
  • [1] [Anonymous], 2007, Uci machine learning repository
  • [2] Businger P., 1965, NUMER MATH, V7, P269, DOI DOI 10.1007/BF01436084
  • [3] LIBSVM: A Library for Support Vector Machines
    Chang, Chih-Chung
    Lin, Chih-Jen
    [J]. ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
  • [4] Collins M, 2008, J MACH LEARN RES, V9, P1775
  • [5] DeCoste D., 2000, Proceedings. KDD-2000. Sixth ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, P345, DOI 10.1145/347090.347165
  • [6] Fan RE, 2005, J MACH LEARN RES, V6, P1889
  • [7] Gill PE., 1991, Numerical Linear Algebra and Optimization
  • [8] GOLUB HG, 1989, MATRIX COMPUTATIONS
  • [9] Efficient computation and model selection for the support vector regression
    Gunter, Lacey
    Zhu, Ji
    [J]. NEURAL COMPUTATION, 2007, 19 (06) : 1633 - 1655
  • [10] Hastie T, 2004, J MACH LEARN RES, V5, P1391