Text categorization based on regularized linear classification methods

被引:244
作者
Zhang, T [1 ]
Oles, FJ [1 ]
机构
[1] IBM Corp, Thomas J Watson Res Ctr, Dept Math Sci, Yorktown Hts, NY 10598 USA
来源
INFORMATION RETRIEVAL | 2001年 / 4卷 / 01期
关键词
text categorization; linear classification; regularization;
D O I
10.1023/A:1011441423217
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A number of linear classification methods such as the linear least squares fit (LLSF), logistic regression, and support vector machines (SVM's) have been applied to text categorization problems. These methods share the similarity by finding hyperplanes that approximately separate a class of document vectors from its complement. However, support vector machines are so far considered special in that they have been demonstrated to achieve the state of the art performance. It is therefore worthwhile to understand whether such good performance is unique to the SVM design, or if it can also be achieved by other linear classification methods. In this paper, we compare a number of known linear classification methods as well as some variants in the framework of regularized linear systems. We will discuss the statistical and numerical properties of these algorithms, with a focus on text categorization. We will also provide some numerical experiments to illustrate these algorithms on a number of datasets.
引用
收藏
页码:5 / 31
页数:27
相关论文
共 27 条
[1]  
[Anonymous], 1990, PERCEPTRONS
[2]  
[Anonymous], ADV LARGE MARGIN CLA
[3]   AUTOMATED LEARNING OF DECISION RULES FOR TEXT CATEGORIZATION [J].
APTE, C ;
DAMERAU, F ;
WEISS, SM .
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 1994, 12 (03) :233-251
[4]  
COOPER WS, 1992, SIGIR 92 : PROCEEDINGS OF THE FIFTEENTH ANNUAL INTERNATIONAL ACM SIGIR CONFERENCE ON RESEARCH AND DEVELOPMENT IN INFORMATION RETRIEVAL, P198
[5]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[6]  
Dumais S., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P148, DOI 10.1145/288627.288651
[7]  
Fuhr N., 1991, SIGIR Forum, P46
[8]  
Gey F. C., 1994, SIGIR '94. Proceedings of the Seventeenth Annual International ACM-SIGIR Conference on Research and Development in Information Retrieval, P222
[9]  
Goulub G.H., 1996, Matrix Computations, Vthird
[10]  
Hastie T., 1990, Generalized additive model