Feature subset selection by Bayesian networks:: a comparison with genetic and sequential algorithms

被引:37
作者
Inza, I [1 ]
Larrañaga, P [1 ]
Sierra, B [1 ]
机构
[1] Univ Basque Country, Dept Comp Sci & Artificial Intelligence, E-20080 Donostia San Sebastian, Basque Country, Spain
关键词
feature subset selection; estimation of distribution algorithm; soft computing; estimation of Bayesian network algorithm; Bayesian network; predictive accuracy;
D O I
10.1016/S0888-613X(01)00038-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we perform a comparison among FSS-EBNA, a randomized, population-based and evolutionary algorithm, and two genetic and other two sequential search approaches in the well-known feature subset selection (FSS) problem. In FSS-EBNA, the FSS problem, stated as a search problem, uses the estimation of Bayesian network algorithm (EBNA) search engine, an algorithm within the estimation of distribution algorithm (EDA) approach. The EDA paradigm is born from the roots of the genetic algorithm (GA) community in order to explicitly discover the relationships among the features of the problem and not disrupt them by genetic recombination operators. The EDA paradigm avoids the use of recombination operators and it guarantees the evolution of the population of solutions and the discovery of these relationships by the factorization of the probability distribution of best individuals in each generation of the search. In EBNA, this factorization is carried out by a Bayesian network induced by a cheap local search mechanism. FSS-EBNA can be seen as a hybrid Soft Computing system, a synergistic combination of probabilistic and evolutionary computing to solve the FSS task. Promising results on a set of real Data Mining domains are achieved by FSS-EBNA in the comparison respect to well-known genetic and sequential search algorithms. (C) 2001 Elsevier Science Inc. All rights reserved.
引用
收藏
页码:143 / 164
页数:22
相关论文
共 60 条
[1]  
Acid S, 1995, LECT NOTES COMPUT SC, V945, P149, DOI 10.1007/BFb0035946
[2]  
AHA DW, 1994, P AAAI 94 WORKSH CAS, P106
[3]   Combined 5 x 2 cv F test for comparing supervised classification learning algorithms [J].
Alpaydin, E .
NEURAL COMPUTATION, 1999, 11 (08) :1885-1892
[4]  
[Anonymous], 1989, GENETIC ALGORITHM SE
[5]  
[Anonymous], P UAI
[6]  
[Anonymous], [No title captured], DOI DOI 10.1016/B978-1-55860-332-5.50055-9
[7]  
[Anonymous], 1990, SUBSET SELECTION REG, DOI DOI 10.1007/978-1-4899-2939-6
[8]  
Back T, 1996, EVOLUTIONARY ALGORIT
[9]  
Baluja S., 1994, POPULATION BASED INC
[10]  
Baluja S., 1997, Proceedings of the 14'th International Conference on Machine Learning, P30