On the Learnability and Design of Output Codes for Multiclass Problems

被引:23
作者
Koby Crammer
Yoram Singer
机构
[1] The Hebrew University,School of Computer Science & Engineering
来源
Machine Learning | 2002年 / 47卷
关键词
multiclass categorization; output coding; SVM;
D O I
暂无
中图分类号
学科分类号
摘要
Output coding is a general framework for solving multiclass categorization problems. Previous research on output codes has focused on building multiclass machines given predefined output codes. In this paper we discuss for the first time the problem of designing output codes for multiclass problems. For the design problem of discrete codes, which have been used extensively in previous works, we present mostly negative results. We then introduce the notion of continuous codes and cast the design problem of continuous codes as a constrained optimization problem. We describe three optimization problems corresponding to three different norms of the code matrix. Interestingly, for the l2 norm our formalism results in a quadratic program whose dual does not depend on the length of the code. A special case of our formalism provides a multiclass scheme for building support vector machines which can be solved efficiently. We give a time and space efficient algorithm for solving the quadratic program. We describe preliminary experiments with synthetic data show that our algorithm is often two orders of magnitude faster than standard quadratic programming packages. We conclude with the generalization properties of the algorithm.
引用
收藏
页码:201 / 233
页数:32
相关论文
共 15 条
  • [1] Aha D. W.(1997)Cloud classification using error-correcting output codes Artificial Intelligence Applications: Natural Science, Agriculture, and Environmental Science 11 13-28
  • [2] Bankert R. L.(1995)Support-vector networks Machine Learning 20 273-297
  • [3] Cortes C.(1995)Solving multiclass learning problems via error-correcting output codes Journal of Artificial Intelligence Research 2 263-286
  • [4] Vapnik V.(1998)Classification by pairwise coupling The Annals of Statistics 26 451-471
  • [5] Dietterich T. G.(1995)Robust trainability of single neurons Journal of Computer and System Sciences 50 114-125
  • [6] Bakiri G.(1998)The error coding method and PiCT Journal of Computational and Graphical Stastistics 7 377-387
  • [7] Hastie T.(1999)Improved boosting algorithms using confidence-rated predictions Machine Learning 37 1-40
  • [8] Tibshirani R.(undefined)undefined undefined undefined undefined-undefined
  • [9] Höffgen K. U.(undefined)undefined undefined undefined undefined-undefined
  • [10] Horn K. S. V.(undefined)undefined undefined undefined undefined-undefined