Filter-based optimization techniques for selection of feature subsets in ensemble systems

被引:48
作者
Santana, Laura Emmanuella A. dos S. [1 ]
de Paula Canuto, Anne M. [1 ]
机构
[1] Fed Univ RN, Informat & Appl Math Dept, BR-59072970 Natal, RN, Brazil
关键词
Ensemble systems; Feature selection; Particle swarm optimization; Ant colony optimization; Genetic algorithms; CLASSIFICATION; DIVERSITY; ACCURACY;
D O I
10.1016/j.eswa.2013.08.059
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Feature selection methods select a subset of attributes (features) of a dataset and it is done based on a defined measure, eliminating the redundant and irrelevant ones. When a feature selection method is applied in a dataset, we aim to improve the quality of the dataset representation. For ensemble systems, feature selection techniques can supply different feature subsets for the individual components, reducing the redundancy that can exist among the features of an input pattern and to increase the diversity level of these systems. This paper proposes the application of three well-known optimization techniques (particle swarm optimization, ant-colony optimization and genetic algorithms), in both mono and bi-objective versions, to choose subsets of features for the individual components of ensembles. The feature selection process was based on two filter-based evaluation criteria that tried to capture the idea of diversity of individual classifiers and group diversity of an ensemble system. In this case, these optimization techniques try to maximize these diversities measures, either individually (mono-objective) or together (bi-objective). An empirical analysis was performed, where all ensemble systems were applied to 11 datasets and we compared both mono and bi-objective versions among each other and with a random subset procedure. Based on the empirical analysis, we will observe that PSO with a bi-objective function will be the most promising direction, when selecting attributes for individual components of ensemble systems. (C) 2013 Elsevier Ltd. All rights reserved.
引用
收藏
页码:1622 / 1631
页数:10
相关论文
共 36 条
[1]
[Anonymous], 2004, COMBINING PATTERN CL, DOI DOI 10.1002/0471660264
[2]
[Anonymous], TECHNICAL REPORT
[3]
[Anonymous], CORREALTION PARAMETR
[4]
[Anonymous], P 9 BIENN C AUSTR PA
[5]
[Anonymous], IEEE INT JOINT C NEU
[6]
[Anonymous], THESIS FEDERAL U RIO
[7]
[Anonymous], 1975, Ann Arbor
[8]
[Anonymous], 1992, THESIS DIPARTIMENTO
[9]
[Anonymous], 1995, 1995 IEEE INT C
[10]
[Anonymous], THESIS U ANTWERP ANT