Fuzzy-rough attribute reduction with application to web categorization

被引:375
作者
Jensen, R [1 ]
Shen, Q [1 ]
机构
[1] Univ Edinburgh, Div Informat, Ctr Intelligent Syst Applicat, Edinburgh, Midlothian, Scotland
基金
英国工程与自然科学研究理事会;
关键词
attribute reduction; web categorization; data redundancy;
D O I
10.1016/S0165-0114(03)00021-6
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Due to the explosive growth of electronically stored information, automatic methods must be developed to aid users in maintaining and using this abundance of information effectively. In particular, the sheer volume of redundancy present must be dealt with, leaving only the information-rich data to be processed. This paper presents a novel approach, based on an integrated use of fuzzy and rough set theories, to greatly reduce this data redundancy. Formal concepts of fuzzy-rough attribute reduction are introduced and illustrated with a simple example. The work is applied to the problem of web categorization, considerably reducing dimensionality with minimal loss of information. Experimental results show that fuzzy-rough reduction is more powerful than the conventional rough set-based approach. Classifiers that use a lower dimensional set of attributes which are retained by fuzzy-rough reduction outperform those that employ more attributes returned by the existing crisp rough reduction method. (C) 2003 Elsevier B.V. All rights reserved.
引用
收藏
页码:469 / 485
页数:17
相关论文
共 25 条
[1]  
[Anonymous], ROUGH FUZZY HYBRIDIZ
[2]  
[Anonymous], 2001, AS PAC C WEB INT
[3]  
CHOUCHOULAS A, 2002, P 2002 UK WORKSH COM, P18
[4]  
Dash M., 1997, Intelligent Data Analysis, V1
[5]  
Devijver P., 1982, PATTERN RECOGN
[6]  
Dubois D., 1992, Putting Rough Sets and Fuzzy Sets Together, P203, DOI [10.1007/978-94-015-7975-9_14, DOI 10.1007/978-94-015-7975-9_14]
[7]  
DUNTSCH I, 1999, ROUGH SET DATA ANAL
[8]   QUOTIENTS WITH RESPECT TO SIMILARITY RELATIONS [J].
HOHLE, U .
FUZZY SETS AND SYSTEMS, 1988, 27 (01) :31-44
[9]  
Jensen R, 2002, PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, P29, DOI 10.1109/FUZZ.2002.1004954
[10]  
KIRA K, 1992, AAAI-92 PROCEEDINGS : TENTH NATIONAL CONFERENCE ON ARTIFICIAL INTELLIGENCE, P129