Text feature selection using ant colony optimization

被引:276
作者
Aghdam, Mehdi Hosseinzadeh [1 ]
Ghasem-Aghaee, Nasser [1 ]
Basiri, Mohammad Ehsan [1 ]
机构
[1] Univ Isfahan, Fac Engn, Dept Comp Engn, Esfahan, Iran
关键词
Feature selection; Ant colony optimization; Generic algorithm; Text categorization; REDUCTION;
D O I
10.1016/j.eswa.2008.08.022
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Feature selection and feature extraction are the most important steps in classification systems. Feature selection is commonly used to reduce dimensionality of datasets with tens or hundreds of thousands of features which Would be impossible to process further. One of the problems in which feature selection is essential is text categorization. A major problem of text categorization is the high dimensionality of the feature space; therefore, feature selection is the most important step in text categorization. At present there are many methods to deal with text feature selection. To improve the performance of text categorization, we present a novel feature selection algorithm that is based on ant colony optimization. Ant colony optimization algorithm is inspired by observation on real ants in their search for the shortest paths to food sources. Proposed algorithm is easily implemented and because of use of a simple classifier in that, its computational complexity is very low. The performance of proposed algorithm is compared to the performance of genetic algorithm, information gain and CHI on the task of feature selection in Reuters-21578 dataset. Simulation results on Reuters-21578 dataset show the Superiority of the proposed algorithm. (c) 2008 Elsevier Ltd. All rights reserved.
引用
收藏
页码:6843 / 6853
页数:11
相关论文
共 42 条
[1]
Al-Ani A, 2005, PROC WRLD ACAD SCI E, V4, P35
[2]
[Anonymous], 1999, Reuters-21578 Text Categorization Test Collection
[3]
[Anonymous], P 14 INT C MACH LEAR
[4]
[Anonymous], 2004, IEEE Comput. Intell. Bull.
[5]
Basiri ME, 2008, LECT NOTES COMPUT SC, V4973, P12, DOI 10.1007/978-3-540-78757-0_2
[6]
Bins J, 2000, THESIS COLORADO STAT
[7]
The hyper-cube framework for ant colony optimization [J].
Blum, C ;
Dorigo, M .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2004, 34 (02) :1161-1172
[8]
Bonabeau E., 1999, Swarm intelligence: from natural to artificial systems, DOI [10.1093/oso/9780195131581.001.0001, DOI 10.1093/OSO/9780195131581.001.0001]
[9]
CAROPRESO MF, 2006, BAG WORDS TEXT REPRE, P324
[10]
Ant colony optimization theory: A survey [J].
Dorigo, M ;
Blum, C .
THEORETICAL COMPUTER SCIENCE, 2005, 344 (2-3) :243-278