Hybrid attribute reduction based on a novel fuzzy-rough model and information granulation

被引:385
作者
Hu, Qinghua [1 ]
Xie, Zongxia [1 ]
Yu, Daren [1 ]
机构
[1] Harbin Inst Technol, Harbin 150001, Peoples R China
关键词
numerical feature; categorical feature; feature selection; attribute reduction; fuzzy set; rough set; inclusion degree;
D O I
10.1016/j.patcog.2007.03.017
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feature subset selection has become an important challenge in areas of pattern recognition, machine learning and data mining. As different semantics are hidden in numerical and categorical features, there are two strategies for selecting hybrid attributes: discretizing numerical variables or numericalize categorical features. In this paper, we introduce a simple and efficient hybrid attribute reduction algorithm based on a generalized fuzzy-rough model. A theoretic framework of fuzzy-rough model based on fuzzy relations is presented, which underlies a foundation for algorithm construction. We derive several attribute significance measures based on the proposed fuzzy-rough model and construct a forward greedy algorithm for hybrid attribute reduction. The experiments show that the technique of variable precision fuzzy inclusion in computing decision positive region can get the optimal classification performance. Number of the selected features is the least but accuracy is the best. (c) 2007 Pattern Recognition Society. Published by Elsevier Ltd. All rights reserved.
引用
收藏
页码:3509 / 3521
页数:13
相关论文
共 59 条
[1]   A model of granular data: a design problem with the Tchebyschev FCM [J].
Bargiela, A ;
Pedrycz, W .
SOFT COMPUTING, 2005, 9 (03) :155-163
[2]   Recursive information granulation: Aggregation and interpretation issues [J].
Bargiela, A ;
Pedrycz, W .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2003, 33 (01) :96-112
[3]   Fuzzy information granules in time series data [J].
Berthold, MR ;
Ortolani, M ;
Patterson, D ;
Höppner, F ;
Callan, O ;
Hofer, H .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2004, 19 (07) :607-618
[4]   An introduction of the condition class space with continuous value discretization and rough set theory [J].
Beynon, MJ .
INTERNATIONAL JOURNAL OF INTELLIGENT SYSTEMS, 2006, 21 (02) :173-191
[5]   On the compact computational domain of fuzzy-rough sets [J].
Bhatt, RB ;
Gopal, M .
PATTERN RECOGNITION LETTERS, 2005, 26 (11) :1632-1640
[6]   On fuzzy-rough sets approach to feature selection [J].
Bhatt, RB ;
Gopal, M .
PATTERN RECOGNITION LETTERS, 2005, 26 (07) :965-975
[7]  
Bortolan G, 2002, IEEE T FUZZY SYST, V10, P743, DOI [10.1109/TFUZZ.2002.805891, 10.1109/TFUZZ.2002,805891]
[8]  
Chen Y H, 2006, P 2006 IEEE INT C GR
[9]   Global discretization of continuous attributes as preprocessing for machine learning [J].
Chmielewski, MR ;
GrzymalaBusse, JW .
INTERNATIONAL JOURNAL OF APPROXIMATE REASONING, 1996, 15 (04) :319-331
[10]   Consistency-based search in feature selection [J].
Dash, M ;
Liu, HA .
ARTIFICIAL INTELLIGENCE, 2003, 151 (1-2) :155-176