A family of additive online algorithms for category ranking

被引:108
作者
Crammer, K [1 ]
Singer, Y [1 ]
机构
[1] Hebrew Univ Jerusalem, Sch Engn & Comp Sci, IL-91904 Jerusalem, Israel
关键词
D O I
10.1162/153244303322533188
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We describe a new family of topic-ranking algorithms for multi-labeled documents. The motivation for the algorithms stem from recent advances in online learning algorithms. The algorithms are simple to implement and are also time and memory efficient. We provide a unified analysis of the family of algorithms in the mistake bound model. We then discuss experiments with the proposed family of topic-ranking algorithms on the Reuters-21578 corpus and the new corpus released by Reuters in 2000. On both corpora, the algorithms we present achieve state-of-the-art results and outperforms topic-ranking adaptations of Rocchio's algorithm and of the Perceptron algorithm.
引用
收藏
页码:1025 / 1058
页数:34
相关论文
共 19 条
[1]  
[Anonymous], MACHINE LEARNING
[2]  
COLLINS M, 2002, IN PRESS 30 ANN M AS
[3]  
CRAMMER K, 2001, ADV NEURAL INFORMATI, V14
[4]  
CRAMMER K, 2001, P 14 ANN C COMP LEAR
[5]  
Cristianini N, 2000, Intelligent Data Analysis: An Introduction
[6]  
ELISSEFF A, 2001, ADV NEURAL INFORMATI, V14
[7]  
FREUND Y, 1998, IN PRESS MACHINE LEA
[8]  
Freund Y., 1998, MACH LEARN P 15 INT
[9]  
Hart P.E., 1973, Pattern recognition and scene analysis
[10]   ON WEAK LEARNING [J].
HELMBOLD, DP ;
WARMUTH, MK .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1995, 50 (03) :551-573