Supervised hybrid feature selection based on PSO and rough sets for medical diagnosis

被引:265
作者
Inbarani, H. Hannah [1 ]
Azar, Ahmad Taher [2 ]
Jothi, G. [3 ]
机构
[1] Periyar Univ, Dept Comp Sci, Salem 636011, Tamil Nadu, India
[2] Benha Univ, Fac Comp & Informat, Banha, Egypt
[3] Sona Coll Technol, Dept IT, Salem 636005, Tamil Nadu, India
关键词
Particle Swarm Optimization (PSO); Rough sets; Feature Selection (FS); Relative Reduct; Quick Reduct; FEATURE SUBSET-SELECTION; CLASSIFICATION; ALGORITHM;
D O I
10.1016/j.cmpb.2013.10.007
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Medical datasets are often classified by a large number of disease measurements and a relatively small number of patient records. All these measurements (features) are not important or irrelevant/noisy. These features may be especially harmful in the case of relatively small training sets, where this irrelevancy and redundancy is harder to evaluate. On the other hand, this extreme number of features carries the problem of memory usage in order to represent the dataset. Feature Selection (FS) is a solution that involves finding a subset of prominent features to improve predictive accuracy and to remove the redundant features. Thus, the learning model receives a concise structure without forfeiting the predictive accuracy built by using only the selected prominent features. Therefore, nowadays, FS is an essential part of knowledge discovery. In this study, new supervised feature selection methods based on hybridization of Particle Swarm Optimization (PSO), PSO based Relative Reduct (PSO-RR) and PSO based Quick Reduct (PSO-QR) are presented for the diseases diagnosis. The experimental result on several standard medical datasets proves the efficiency of the proposed technique as well as enhancements over the existing feature selection techniques. (C) 2013 Elsevier Ireland Ltd. All rights reserved.
引用
收藏
页码:175 / 185
页数:11
相关论文
共 41 条
[1]
GMDH-based feature ranking and selection for improved classification of medical data [J].
Abdel-Aal, RE .
JOURNAL OF BIOMEDICAL INFORMATICS, 2005, 38 (06) :456-468
[2]
[Anonymous], 2012, INT J ENG RES TECHNO
[3]
[Anonymous], UCI Repository of machine learning databases
[4]
[Anonymous], ROUGH COMPUTING THEO
[5]
[Anonymous], P INT C DAT MIN JUN
[6]
A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets [J].
Bermejo, Pablo ;
Gamez, Jose A. ;
Puerta, Jose M. .
PATTERN RECOGNITION LETTERS, 2011, 32 (05) :701-711
[7]
An attribute weight assignment and particle swarm optimization algorithm for medical database classifications [J].
Chang, Pei-Chann ;
Lin, Jyun-Jie ;
Liu, Chen-Hao .
COMPUTER METHODS AND PROGRAMS IN BIOMEDICINE, 2012, 107 (03) :382-392
[8]
A support vector machine classifier with rough set-based feature selection for breast cancer diagnosis [J].
Chen, Hui-Ling ;
Yang, Bo ;
Liu, Jie ;
Liu, Da-You .
EXPERT SYSTEMS WITH APPLICATIONS, 2011, 38 (07) :9014-9022
[9]
Particle swarm optimization for feature selection with application in obstructive sleep apnea diagnosis [J].
Chen, Li-Fei ;
Su, Chao-Ton ;
Chen, Kun-Huang ;
Wang, Pa-Chun .
NEURAL COMPUTING & APPLICATIONS, 2012, 21 (08) :2087-2096
[10]
Eberhart R., 1995, MHS 95, P39, DOI [DOI 10.1109/MHS.1995.494215, 10.1109/MHS.1995.494215]