Data weighting method on the basis of binary encoded output to solve multi-class pattern classification problems

被引:6
作者
Polat, Kemal [1 ]
机构
[1] Abant Izzet Baysal Univ, Fac Engn & Architecture, Dept Elect & Elect Engn, Bolu, Turkey
关键词
Data weighting; Binary encoded output based data weighting (BEOBDW); k-NN classifier; Multi-class data classification;
D O I
10.1016/j.eswa.2013.02.004
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Data weighting is of paramount importance with respect to classification performance in pattern recognition applications. In this paper, the output labels of datasets have been encoded using binary codes (numbers) and by this way provided a novel data weighting method called binary encoded output based data weighting (BEOBDW). In the proposed data weighting method, first of all, the output labels of datasets have been encoded with binary codes and then obtained two encoded output labels. Depending to these encoded outputs, the data points in datasets have been weighted using the relationships between features of datasets and two encoded output labels. To generalize the proposed data weighting method, five datasets have been used. These datasets are chain link (2 classes), two spiral (2 classes), iris (3 classes), wine (3 classes), and dermatology (6 classes). After applied BEOBDW to five datasets, the kappa-NN (nearest neighbor) classifier has been used to classify the weighted datasets. A set of experiments on used real world datasets demonstrated that the proposed data weighting method is a very efficient and has robust discrimination ability in the classification of datasets. BEOBDW method could be confidently used before many classification algorithms. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:4637 / 4647
页数:11
相关论文
共 11 条
[1]
[Anonymous], 2013, UCI MACHINE LEARNING
[2]
[Anonymous], 2006, COMPSCI
[3]
Belur V., 1991, Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques
[4]
Multi-label classification and extracting predicted class hierarchies [J].
Brucker, Florian ;
Benites, Fernando ;
Sapozhnikova, Elena .
PATTERN RECOGNITION, 2011, 44 (03) :724-738
[5]
A dynamic over-sampling procedure based on sensitivity for multi-class problems [J].
Fernandez-Navarro, Francisco ;
Hervas-Martinez, Cesar ;
Antonio Gutierrez, Pedro .
PATTERN RECOGNITION, 2011, 44 (08) :1821-1833
[6]
An overview of ensemble methods for binary classifiers in multi-class problems: Experimental study on one-vs-one and one-vs-all schemes [J].
Galar, Mikel ;
Fernandez, Alberto ;
Barrenechea, Edurne ;
Bustince, Humberto ;
Herrera, Francisco .
PATTERN RECOGNITION, 2011, 44 (08) :1761-1776
[7]
Shakhnarovich G., 2005, NEAREST NEIGHBOR MET
[8]
Statistical approaches to combining binary classifiers for multi-class classification [J].
Shiraishi, Yuichi ;
Fukumizu, Kenji .
NEUROCOMPUTING, 2011, 74 (05) :680-688
[9]
Sogukpinar I., 2008, EMO YAYINLARI, P120
[10]
A transitivity analysis of bipartite rankings in pairwise multi-class classification [J].
Waegeman, Willem ;
De Baets, Bernard .
INFORMATION SCIENCES, 2010, 180 (21) :4099-4117