A comparison study on multiple binary-class SVM methods for unilabel text categorization

被引:47
作者
Kumar, M. Arun [1 ]
Gopal, M. [1 ]
机构
[1] Indian Inst Technol Delhi, Dept Elect Engn, Control Grp, New Delhi 110016, India
关键词
Multiclass classification; One-against-all; One-against-one; Text categorization; Support vector machines (SVMs); CLASSIFICATION;
D O I
10.1016/j.patrec.2010.02.015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multiclass support vector machine (SVM) methods are well studied in recent literature. Comparison studies on UCI/statlog multiclass datasets suggest using one-against-one method for multiclass SVM classification. However, in unilabel (multiclass) text categorization with SVMs, no comparison studies exist with one-against-one and other methods, e.g. one-against-all and several well-known improvements to these approaches. In this paper, we bridge this gap by performing empirical comparison of standard one-against-all and one-against-one, together with three improvements to these standard approaches for unilabel text categorization with SVM as base binary learner. We performed all our experiments on three standard text corpuses using two types of document representation. Outcome of our experiments partly support Rifkin and Klautau's (2004) statement that, for small scale unilabel text categorization tasks, if parameters of the classifiers are well tuned, one-against-all will have better performance than one-against-one and other methods. (C) 2010 Elsevier B.V. All rights reserved.
引用
收藏
页码:1437 / 1444
页数:8
相关论文
共 28 条
  • [1] A new text categorization technique using distributional clustering and learning logic
    Al-Mubaid, Hisham
    Umair, Syed A.
    [J]. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2006, 18 (09) : 1156 - 1165
  • [2] [Anonymous], CMUCS97127 CARN MELL
  • [3] [Anonymous], 1997, Proceedings of the 14th International Conference on Machine Learning, DOI DOI 10.1016/J.ESWA.2008.05.026
  • [4] [Anonymous], P 19 ANN INT ACM SIG
  • [5] Baker L. D., 1998, Proceedings of the 21st Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P96, DOI 10.1145/290941.290970
  • [6] Bekkerman R., 2003, Journal of Machine Learning Research, V3, P1183, DOI 10.1162/153244303322753625
  • [7] Berger A, 1999, IJCAI 99 WORKSH MACH
  • [8] CRAVEN M, 1998, P NAT C ART INT AAAI
  • [9] Dietterich T. G., 1995, Journal of Artificial Intelligence Research, V2, P263
  • [10] Dumais S., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P148, DOI 10.1145/288627.288651