Feature selection for ensembles applied to handwriting recognition

被引:31
作者
Oliveira, Luiz S.
Morita, Marisa
Sabourin, Robert
机构
[1] Pontificia Univ Catolica Parana, PPGIA, BR-80215901 Curitiba, Parana, Brazil
[2] Ecole Technol Super, Lab Imagerie Vis Intelligence Artificielle, Montreal, PQ H3C 1K3, Canada
关键词
ensemble of classifiers; feature selection; handwriting recognition; multi-objective optimization; genetic algorithms;
D O I
10.1007/s10032-005-0013-6
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature selection for ensembles has shown to be an effective strategy for ensemble creation clue to its ability of producing good subsets of features, which make the classifiers of the ensemble disagree on difficult cases. In this paper we present an ensemble feature selection approach based on a hierarchical multi-objective genetic algorithm. The underpinning paradigm is the "overproduce and choose". The algorithm operates in two levels. Firstly, it performs feature selection in order to generate a set of classifiers and then it chooses the best team of classifiers. In order to show its robustness, the method is evaluated in two different contexts: supervised and unsupervised feature selection. In the former, we have considered the problem of handwritten digit recognition and used three different feature sets and multi-layer perceptron neural networks as classifiers. In the latter, we took into account the problem of handwritten month word recognition and used three different feature sets and hidden Markov models as classifiers. Experiments and comparisons with classical methods, such as Bagging and Boosting, demonstrated that the proposed methodology brings compelling improvements when classifiers have to work with very low error rates. Comparisons have been done by considering the recognition rates only.
引用
收藏
页码:262 / 279
页数:18
相关论文
共 51 条
[1]  
[Anonymous], [No title captured], DOI DOI 10.1145/347090.347169
[2]  
[Anonymous], 1991, Handbook of genetic algorithms
[3]  
Bishop C. M., 1996, Neural networks for pattern recognition
[4]  
Breiman L, 1996, MACH LEARN, V24, P49
[5]  
Cantu-Paz E., 2000, EFFICIENT ACCURATE P
[6]   Dual classifier system for handprinted alphanumeric character recognition [J].
Chim, YC ;
Kassim, AA ;
Ibrahim, Y .
PATTERN ANALYSIS AND APPLICATIONS, 1998, 1 (03) :155-162
[7]  
De Stefano C, 2003, PROC INT CONF DOC, P671
[8]  
DEB K, 1989, PROCEEDINGS OF THE THIRD INTERNATIONAL CONFERENCE ON GENETIC ALGORITHMS, P42
[9]  
Deb K, 2011, Multi-objective optimization using evolutionary algorithm
[10]  
DY JG, 2000, P 17 INT C MACH LEAR