On the algorithmic implementation of multiclass kernel-based vector machines

被引:799
作者
Crammer, K [1 ]
Singer, Y [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
关键词
multiclass problems; SVM; kernel machines;
D O I
10.1162/15324430260185628
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we describe the algorithmic implementation of multiclass kernel-based vector machines. Our starting point is a generalized notion of the margin to multiclass problems. Using this notion we cast multiclass categorization problems as a constrained optimization problem with a quadratic objective function. Unlike most of previous approaches which typically decompose a multiclass problem into multiple independent binary classification tasks, our notion of margin yields a direct method for training multiclass predictors. By using the dual of the optimization problem we are able to incorporate kernels with a compact set of constraints and decompose the dual problem into multiple optimization problems of reduced size. We describe an efficient fixed-point algorithm for solving the reduced optimization problems and prove its convergence. We then discuss technical details that yield significant running time improvements for large datasets. Finally, we describe various experiments with our approach comparing it to previously studied kernel-based methods. Our experiments indicate that for multiclass problems we attain state-of-the-art accuracy.
引用
收藏
页码:265 / 292
页数:28
相关论文
共 30 条
  • [1] ALLWEIN EL, 2000, MACHING LEARNING
  • [2] [Anonymous], 1999, SUPPORT VECTOR MACHI
  • [3] [Anonymous], 1997, THESIS
  • [4] [Anonymous], 1990, SUPPORT VECTOR LEARN
  • [5] [Anonymous], MACHINE LEARNING
  • [6] Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
  • [7] Multicategory classification by support vector machines
    Bredensteiner, EJ
    Bennett, KP
    [J]. COMPUTATIONAL OPTIMIZATION AND APPLICATIONS, 1999, 12 (1-3) : 53 - 79
  • [8] Bregman LM, 1967, USSR Computational Mathematics and Mathematical Physics, V7, P200
  • [9] Breiman L., 1984, BIOMETRICS, DOI DOI 10.2307/2530946
  • [10] A tutorial on Support Vector Machines for pattern recognition
    Burges, CJC
    [J]. DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) : 121 - 167