Combining multiple classifiers using Dempster's rule for text categorization

被引:21
作者
Bi, Yaxin [1 ]
Bell, David
Wang, Hui
Guo, Gongde
Guan, Jiwen
机构
[1] Univ Ulster, Sch Comp & Math, Newtownabbey, Antrim, North Ireland
[2] Queens Univ Belfast, Sch Comp Sci, Belfast, Antrim, North Ireland
[3] Fujian Normal Univ, Dept Comp Sci, Fuzhou, Peoples R China
关键词
D O I
10.1080/08839510601170887
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we investigate the combination of four machine learning methods for text categorization using Dempster's rule of combination. These methods include Support Vector Machine (SVM), kNN (Nearest Neighbor), kNN model-based approach (kNNM), and Rocchio. We first present a general representation of the outputs of different classifiers, in particular, modeling it as a piece of evidence by using a novel evidence structure called focal element triplet. Furthermore, we investigate an effective method for combining pieces of evidence derived from classifiers generated by a 10-fold cross-validation. Finally, we evaluate our methods on the 20-newsgroup and Reuters-21578 benchmark data sets and perform the comparative analysis with majority voting in combining multiple classifiers along with the previous result. Our experimental results show that the best combined classifier can improve the performance of the individual classifiers and Dempster's rule of combination outperforms majority voting in combining multiple classifiers.
引用
收藏
页码:211 / 239
页数:29
相关论文
共 31 条
[1]  
BARNETT JA, 1981, P 7 JOINT C ART INT
[2]   The combination of text classifiers using reliability indicators [J].
Bennett, PN ;
Dumais, ST ;
Horvitz, E .
INFORMATION RETRIEVAL, 2005, 8 (01) :67-100
[3]  
BI Y, 2004, THESIS U ULSTER
[4]  
Boser B. E., 1992, Proceedings of the Fifth Annual ACM Workshop on Computational Learning Theory, P144, DOI 10.1145/130385.130401
[5]   LIBSVM: A Library for Support Vector Machines [J].
Chang, Chih-Chung ;
Lin, Chih-Jen .
ACM TRANSACTIONS ON INTELLIGENT SYSTEMS AND TECHNOLOGY, 2011, 2 (03)
[6]  
CORTES C, 1995, MACH LEARN, V20, P273, DOI 10.1023/A:1022627411411
[7]   A neural network classifier based on Dempster-Shafer theory [J].
Denoeux, T .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 2000, 30 (02) :131-150
[8]  
Dumais S., 1998, Proceedings of the 1998 ACM CIKM International Conference on Information and Knowledge Management, P148, DOI 10.1145/288627.288651
[9]  
Guan J.W., 1991, EVIDENCE THEORY ITS, VI
[10]  
Guan J.W., 1991, EVIDENCE THEORY ITS, VII