Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

被引:385
作者
Hu, Qinghua [1 ]
Xie, Zongxia [1 ]
Yu, Daren [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
关键词
numerical feature; categorical feature; feature selection; attribute reduction; fuzzy set; rough set; inclusion degree;
D O I
10.1016/j.patcog.2007.03.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection has become an important challenge in areas of pattern recognition, machine learning and data mining. As different semantics are hidden in numerical and categorical features, there are two strategies for selecting hybrid attributes: discretizing numerical variables or numericalize categorical features. In this paper, we introduce a simple and efficient hybrid attribute reduction algorithm based on a generalized fuzzy-rough model. A theoretic framework of fuzzy-rough model based on fuzzy relations is presented, which underlies a foundation for algorithm construction. We derive several attribute significance measures based on the proposed fuzzy-rough model and construct a forward greedy algorithm for hybrid attribute reduction. The experiments show that the technique of variable precision fuzzy inclusion in computing decision positive region can get the optimal classification performance. Number of the selected features is the least but accuracy is the best. (c) 2007 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:3509 / 3521
页数:13
相关论文
共 59 条
[11]   ROUGH FUZZY-SETS AND FUZZY ROUGH SETS [J].
DUBOIS, D ;
PRADE, H .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1990, 17 (2-3) :191-209
[12]   Generating an interpretable family of fuzzy partitions from data [J].
Guillaume, S ;
Charnomordic, B .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2004, 12 (03) :324-335
[13]  
Guyon I, 2003, J MACH LEARN RES, P1157, DOI [10.1016/j.aca.2011.07.027, DOI 10.1016/J.ACA.2011.07.027]
[14]   Fuzzy probabilistic approximation spaces and their information measures [J].
Hu, QH ;
Yu, DR ;
Xie, ZX ;
Liu, JF .
IEEE TRANSACTIONS ON FUZZY SYSTEMS, 2006, 14 (02) :191-201
[15]   Information-preserving hybrid data reduction based on fuzzy-rough techniques [J].
Hu, QH ;
Yu, DR ;
Xie, ZX .
PATTERN RECOGNITION LETTERS, 2006, 27 (05) :414-423
[16]  
Hu QH, 2005, LECT NOTES ARTIF INT, V3613, P494
[17]   Entropies of fuzzy indiscernibility relation and its operations [J].
Hu, QH ;
Yu, DR .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2004, 12 (05) :575-589
[18]   Semantics-preserving dimensionality reduction: Rough and fuzzy-rough-based approaches [J].
Jensen, R ;
Shen, Q .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2004, 16 (12) :1457-1471
[19]   Fuzzy-rough attribute reduction with application to web categorization [J].
Jensen, R ;
Shen, Q .
FUZZY SETS AND SYSTEMS, 2004, 141 (03) :469-485
[20]  
JENSON R, P IEEE INT C FUZZ SY, P29