Feature selection based on rough sets and particle swarm optimization

被引:580
作者
Wang, Xiangyang [1 ]
Yang, Jie
Teng, Xiaolong
Xia, Weijun
Jensen, Richard
机构
[1] Shanghai Jiao Tong Univ, Inst Image Proc & Pattern Recognit, Shanghai 200030, Peoples R China
[2] Shanghai Jiao Tong Univ, Inst Automat, Shanghai 20030, Peoples R China
[3] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Dyfed, Wales
关键词
feature selection; rough sets; reduct; genetic algorithms; particle swarm optimization; hill-climbing method; Stochastic method;
D O I
10.1016/j.patrec.2006.09.003
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a new feature selection strategy based on rough sets and particle swarm optimization (PSO). Rough sets have been used as a feature selection method with much success, but current hill-climbing rough set approaches to feature selection are inadequate at finding optimal reductions as no perfect heuristic can guarantee optimality. On the other hand, complete searches are not feasible for even medium-sized datasets. So, stochastic approaches provide a promising feature selection mechanism. Like Genetic Algorithms, PSO is a new evolutionary computation technique, in which each potential solution is seen as a particle with a certain velocity flying through the problem space. The Particle Swarms find optimal regions of the complex search space through the interaction of individuals in the population. PSO is attractive for feature selection in that particle swarms will discover best feature combinations as they fly within the subset space. Compared with GAs, PSO does not need complex operators such as crossover and mutation, it requires only primitive and simple mathematical operators, and is computationally inexpensive in terms of both memory and runtime. Experimentation is carried out, using UCI data, which compares the proposed algorithm with a GA-based approach and other deterministic rough set reduction algorithms. The results show that PSO is efficient for rough set-based feature selection. (c) 2006 Elsevier B.V. All rights reserved.
引用
收藏
页码:459 / 471
页数:13
相关论文
共 47 条
[1]  
[Anonymous], ROUGH SETS KNOWLEDGE
[2]  
[Anonymous], 1996, P C INF PROC MAN UNC
[3]  
[Anonymous], 1995, P INT WORKSHOP ROUGH
[4]  
Bazan JG, 2000, STUD FUZZ SOFT COMP, V56, P49
[5]  
Bazan Y., 1998, ROUGH SETS KNOWLEDGE, P321
[6]  
Bell DA, 1998, J AM SOC INFORM SCI, V49, P403, DOI 10.1002/(SICI)1097-4571(19980415)49:5<403::AID-ASI3>3.0.CO
[7]  
2-8
[8]  
BJORVAND AT, 1997, P 15 IMACS WORLD C S, V4, P601
[9]  
BJORVAND AT, 1997, 6 SCAND C ART INT SC
[10]  
Blake C.L., 1998, UCI repository of machine learning databases