Random k-Labelsets for Multilabel Classification

被引:846
作者
Tsoumakas, Grigorios [1 ]
Katakis, Ioannis [1 ]
Vlahavas, Ioannis [1 ]
机构
[1] Aristotle Univ Thessaloniki, Dept Informat, Thessaloniki 54124, Greece
关键词
Categorization; multilabel; ensembles; labelset; classification;
D O I
10.1109/TKDE.2010.164
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
A simple yet effective multilabel learning method, called label powerset (LP), considers each distinct combination of labels that exist in the training set as a different class value of a single-label classification task. The computational efficiency and predictive performance of LP is challenged by application domains with large number of labels and training examples. In these cases, the number of classes may become very large and at the same time many classes are associated with very few training examples. To deal with these problems, this paper proposes breaking the initial set of labels into a number of small random subsets, called labelsets and employing LP to train a corresponding classifier. The labelsets can be either disjoint or overlapping depending on which of two strategies is used to construct them. The proposed method is called RAkEL (RAndom k labELsets), where k is a parameter that specifies the size of the subsets. Empirical evidence indicates that RAkEL manages to improve substantially over LP, especially in domains with large number of labels and exhibits competitive performance against other high-performing multilabel learning methods.
引用
收藏
页码:1079 / 1089
页数:11
相关论文
共 43 条
[1]
[Anonymous], P 2008 IEEE INT JOIN
[2]
[Anonymous], 2008, ISMIR
[3]
Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[4]
Brinker K, 2007, 20TH INTERNATIONAL JOINT CONFERENCE ON ARTIFICIAL INTELLIGENCE, P702
[5]
LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]
Chen W, 2007, PROC MONOGR ENG WATE, P451
[7]
Clare A., 2001, Lecture Notes in Computer Science, P42
[8]
A family of additive online algorithms for category ranking [J].
Crammer, K ;
Singer, Y .
JOURNAL OF MACHINE LEARNING RESEARCH, 2003, 3 (06) :1025-1058
[9]
De Comité F, 2003, LECT NOTES ARTIF INT, V2734, P35
[10]
Demsar J, 2006, J MACH LEARN RES, V7, P1