A rough sets based characteristic relation approach for dynamic attribute generalization in data mining

被引:274
作者
Li, Tianrui [1 ]
Ruan, Da
Geert, Wets
Song, Jing
Xu, Yang
机构
[1] SW Jiaotong Univ, Dept Math, Chengdu 610031, Peoples R China
[2] CEN SCK, Belgian Nucl Res Ctr, B-2400 Mol, Belgium
[3] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium
[4] Univ Hasselt, Dept Appl Econ Sci, B-3590 Diepenbeek, Belgium
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
rough sets; knowledge discovery; data mining; incomplete information systems;
D O I
10.1016/j.knosys.2007.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Any attribute set in an information system may be evolving in time when new information arrives. Approximations of a concept by rough set theory need updating for data mining or other related tasks. For incremental updating approximations of a concept, methods using the tolerance relation and similarity relation have been previously studied in literature. The characteristic relation-based rough sets approach provides more informative results than the tolerance-and-similarity relation based approach. In this paper, an attribute generalization and its relation to feature selection and feature extraction are firstly discussed. Then, a new approach for incrementally updating approximations of a concept is presented under the characteristic relation-based rough sets. Finally, the approach of direct computation of rough set approximations and the proposed approach of dynamic maintenance of rough set approximations are employed for performance comparison. An extensive experimental evaluation on a large soybean database from MLC shows that the proposed approach effectively handles a dynamic attribute generalization in data mining. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 32 条
  • [1] [Anonymous], 2005, INT J COMPUT INTELL, DOI DOI 10.5019/J.IJCIR.2005.24
  • [2] [Anonymous], 2000, New Developments in Knowledge Discovery in Information Systems
  • [3] [Anonymous], 2004, Proceedings of the IPMU
  • [4] [Anonymous], INT C N AM FUZZ INF
  • [5] CHAN C, 1998, INFORM SCI, V107, P177
  • [6] Chang Li-Yun, 1999, Journal of Software, V10, P1206
  • [7] Dash M., 1997, Intelligent Data Analysis, V1
  • [8] An intelligent intrusion detection system (IDS) for anomaly and misuse detection in computer networks
    Depren, O
    Topallar, M
    Anarim, E
    Ciliz, MK
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2005, 29 (04) : 713 - 722
  • [9] Dy JG, 2004, J MACH LEARN RES, V5, P845
  • [10] LEAD:: A methodology for learning efficient approaches to medical diagnosis
    Fakih, SJ
    Das, TK
    [J]. IEEE TRANSACTIONS ON INFORMATION TECHNOLOGY IN BIOMEDICINE, 2006, 10 (02): : 220 - 228