A filter model for feature subset selection based on genetic algorithm

被引:88
作者
Elalami, M. E. [1 ]
机构
[1] Mansoura Univ, Dept Comp Sci, Mansoura 35111, Egypt
关键词
Feature subset selection; Relevant feature; Genetic algorithm; Artificial neural networks; Non-linear optimization; Fitness function;
D O I
10.1016/j.knosys.2009.02.006
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper describes a novel feature subset selection algorithm, which utilizes a genetic algorithm (CA) to optimize the output nodes of trained artificial neural network (ANN). The new algorithm does not depend on the ANN training algorithms or modify the training results. The two groups of weights between input-hidden and hidden-output layers are extracted after training the ANN on a given database. The general formula for each output node (class) of ANN is then generated. This formula depends only on input features because the two groups of weights are constant. This dependency is represented by a non-linear exponential function. The CA is involved to find the optimal relevant features, which maximize the output function for each class. The dominant features in all classes are the features subset to be selected from the input feature group. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:356 / 362
页数:7
相关论文
共 23 条
[1]   Dimensionality reduction approach to multivariate prediction [J].
Abraham, B ;
Merola, G .
COMPUTATIONAL STATISTICS & DATA ANALYSIS, 2005, 48 (01) :5-16
[2]  
[Anonymous], 1997, Machine Learning
[3]   A preprocess algorithm of filtering irrelevant information based on the minimum class difference [J].
Chen, Zhiping ;
Lu, Kevin .
KNOWLEDGE-BASED SYSTEMS, 2006, 19 (06) :422-429
[4]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176
[5]  
ELALAMI ME, 2007, SCI B A SHAMS U, V42, P369
[6]  
ELALFI E, 2001, EGYPTIAN COMPUTER J
[7]   Characterization of empirical discrepancy evaluation measures [J].
Fernández-García, NL ;
Medina-Carnicer, R ;
Carmona-Poyato, A ;
Madrid-Cuevas, FJ ;
Prieto-Villegas, M .
PATTERN RECOGNITION LETTERS, 2004, 25 (01) :35-47
[8]  
Jin X, 2006, LECT NOTES COMPUT SC, V3916, P106
[9]   Information gain and divergence-based feature selection for machine learning-based text categorization [J].
Lee, CK ;
Lee, GG .
INFORMATION PROCESSING & MANAGEMENT, 2006, 42 (01) :155-165
[10]  
Li JY, 2006, LECT NOTES COMPUT SC, V4100, P167