A rough sets based characteristic relation approach for dynamic attribute generalization in data mining

被引:274
作者
Li, Tianrui [1 ]
Ruan, Da
Geert, Wets
Song, Jing
Xu, Yang
机构
[1] SW Jiaotong Univ, Dept Math, Chengdu 610031, Peoples R China
[2] CEN SCK, Belgian Nucl Res Ctr, B-2400 Mol, Belgium
[3] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium
[4] Univ Hasselt, Dept Appl Econ Sci, B-3590 Diepenbeek, Belgium
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
rough sets; knowledge discovery; data mining; incomplete information systems;
D O I
10.1016/j.knosys.2007.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Any attribute set in an information system may be evolving in time when new information arrives. Approximations of a concept by rough set theory need updating for data mining or other related tasks. For incremental updating approximations of a concept, methods using the tolerance relation and similarity relation have been previously studied in literature. The characteristic relation-based rough sets approach provides more informative results than the tolerance-and-similarity relation based approach. In this paper, an attribute generalization and its relation to feature selection and feature extraction are firstly discussed. Then, a new approach for incrementally updating approximations of a concept is presented under the characteristic relation-based rough sets. Finally, the approach of direct computation of rough set approximations and the proposed approach of dynamic maintenance of rough set approximations are employed for performance comparison. An extensive experimental evaluation on a large soybean database from MLC shows that the proposed approach effectively handles a dynamic attribute generalization in data mining. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 32 条
  • [11] Grzymala-Busse JW, 2005, LECT NOTES COMPUT SC, V3700, P58
  • [12] Hall M., 1999, THESIS WAIKATO U NZ
  • [13] Learning rules from incomplete training examples by rough sets
    Hong, TP
    Tseng, LH
    Wang, SL
    [J]. EXPERT SYSTEMS WITH APPLICATIONS, 2002, 22 (04) : 285 - 293
  • [14] KITTLER J, 1986, HDB PATTERN RECOGNIT, P203
  • [15] Rough set approach to incomplete information systems
    Kryszkiewicz, M
    [J]. INFORMATION SCIENCES, 1998, 112 (1-4) : 39 - 49
  • [16] Rough set theory: A data mining tool for semiconductor manufacturing
    Kusiak, A
    [J]. IEEE TRANSACTIONS ON ELECTRONICS PACKAGING MANUFACTURING, 2001, 24 (01): : 44 - 50
  • [17] Decomposition in data mining: An industrial case study
    Kusiak, A
    [J]. IEEE TRANSACTIONS ON ELECTRONICS PACKAGING MANUFACTURING, 2000, 23 (04): : 345 - 354
  • [18] LI T, 2003, INT C MACH LEARN CYB, P1678
  • [19] Li Tianrui, 2000, Journal of Southwest Jiaotong University (English Edition), V8, P69
  • [20] Lingras PJ, 1998, J AM SOC INFORM SCI, V49, P415, DOI 10.1002/(SICI)1097-4571(19980415)49:5<415::AID-ASI4>3.0.CO