Semantics-preserving dimensionality reduction: Rough and fuzzy-rough-based approaches

被引:465
作者
Jensen, R
Shen, Q
机构
[1] Univ Edinburgh, Sch Informat, Ctr Intelligent Syst & Their Applicat, Edinburgh EG8 9LE, Midlothian, Scotland
[2] Univ Wales, Dept Comp Sci, Aberystwyth SY23 3DB, Ceredigion, Wales
关键词
dimensionality reduction; feature selection; feature transformation; rough selection; fuzzy-rough selection;
D O I
10.1109/TKDE.2004.96
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Semantics-preserving dimensionality reduction refers to the problem of selecting those input features that are most predictive of a given outcome; a problem encountered in many areas such as machine learning, pattern recognition, and signal processing. This has found successful application in tasks that involve data sets containing huge numbers of features ( in the order of tens of thousands), which would be impossible to process further. Recent examples include text processing and Web content classification. One of the many successful applications of rough set theory has been to this feature selection area. This paper reviews those techniques that preserve the underlying semantics of the data, using crisp and fuzzy rough set-based methodologies. Several approaches to feature selection based on rough set theory are experimentally compared. Additionally, a new area in feature selection, feature grouping, is highlighted and a rough set-based feature grouping technique is detailed.
引用
收藏
页码:1457 / 1471
页数:15
相关论文
共 72 条
[61]  
STEFANOWSKI J, 2000, ROUGH SETS CURRENT T, P212
[62]   Rough set methods in feature selection and recognition [J].
Swiniarski, RW ;
Skowron, A .
PATTERN RECOGNITION LETTERS, 2003, 24 (06) :833-849
[63]  
THIELE H, 1998, CI3098 U DORTM
[64]  
VANRIJSBERGEN CJ, 1979, INFORMATION RETRIEVA
[65]   Reduction algorithms based on discernibility matrix: The ordered attributes method [J].
Wang, J ;
Wang, J .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2001, 16 (06) :489-504
[66]   ROUGH SETS AND FUZZY-SETS - SOME REMARKS ON INTERRELATIONS [J].
WYGRALAK, M .
FUZZY SETS AND SYSTEMS, 1989, 29 (02) :241-243
[67]   Constructive and algebraic methods of the theory of rough sets [J].
Yao, YY .
INFORMATION SCIENCES, 1998, 109 (1-4) :21-47
[68]   FUZZY SETS [J].
ZADEH, LA .
INFORMATION AND CONTROL, 1965, 8 (03) :338-&
[69]   Using rough sets with heuristics for feature selection [J].
Zhong, N ;
Dong, J ;
Ohsuga, S .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 2001, 16 (03) :199-214
[70]   VARIABLE PRECISION ROUGH SET MODEL [J].
ZIARKO, W .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1993, 46 (01) :39-59