Multi-label classification by exploiting label correlations

被引:95
作者
Yu, Ying [1 ,2 ,3 ,5 ]
Pedrycz, Witold [2 ,4 ]
Miao, Duoqian [1 ,3 ]
机构
[1] Tongji Univ, Dept Comp Sci & Technol, Shanghai 201804, Peoples R China
[2] Univ Alberta, Dept Elect & Comp Engn, Edmonton, AB T6G 2G7, Canada
[3] Tongji Univ, Key Lab Embedded Syst & Serv Comp, Minist Educ, Shanghai 201804, Peoples R China
[4] Polish Acad Sci, Syst Res Inst, PL-01447 Warsaw, Poland
[5] Jiangxi Agr Univ, Sch Software, Nanchang 330013, Peoples R China
基金
中国国家自然科学基金;
关键词
Multi-label classification; Rough sets; Uncertainty; Correlation; DECISION TREES;
D O I
10.1016/j.eswa.2013.10.030
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays, multi-label classification methods are of increasing interest in the areas such as text categorization, image annotation and protein function classification. Due to the correlation among the labels, traditional single-label classification methods are not directly applicable to the multi-label classification problem. This paper presents two novel multi-label classification algorithms based on the variable precision neighborhood rough sets, called multi-label classification using rough sets (MLRS) and MLRS using local correlation (MLRS-LC). The proposed algorithms consider two important factors that affect the accuracy of prediction, namely the correlation among the labels and the uncertainty that exists within the mapping between the feature space and the label space. MLRS provides a global view at the label correlation while MLRS-LC deals with the label correlation at the local level. Given a new instance, MLRS determines its location and then computes the probabilities of labels according to its location. The MLRS-LC first finds out its topic and then the probabilities of new instance belonging to each class is calculated in related topic. A series of experiments reported for seven multi-label datasets show that MLRS and MLRS-LC achieve promising performance when compared with some well-known multi-label learning algorithms. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2989 / 3004
页数:16
相关论文
共 31 条
[1]  
[Anonymous], 2008, ISMIR
[2]   Learning multi-label scene classification [J].
Boutell, MR ;
Luo, JB ;
Shen, XP ;
Brown, CM .
PATTERN RECOGNITION, 2004, 37 (09) :1757-1771
[3]  
De Comité F, 2003, LECT NOTES ARTIF INT, V2734, P35
[4]   Representing uncertainty on set-valued variables using belief functions [J].
Denoeux, Thierry ;
Youne, Zoulficar ;
Abdallah, Fahed .
ARTIFICIAL INTELLIGENCE, 2010, 174 (7-8) :479-499
[5]  
Duygulu P., 2006, COMP VIS ECCV 2002, P97
[6]  
Elisseeff A, 2002, ADV NEUR IN, V14, P681
[7]  
Fan R, 2007, STUDY THRESHOLD SELE
[8]  
Godbole S, 2004, LECT NOTES ARTIF INT, V3056, P22
[9]   Mixed feature selection based on granulation and approximation [J].
Hu, Qinghua ;
Liu, Jinfu ;
Yu, Daren .
KNOWLEDGE-BASED SYSTEMS, 2008, 21 (04) :294-304
[10]   Neighborhood classifiers [J].
Hu, Qinghua ;
Yu, Daren ;
Me, Zongxia .
EXPERT SYSTEMS WITH APPLICATIONS, 2008, 34 (02) :866-876