A rough sets based characteristic relation approach for dynamic attribute generalization in data mining

被引:274
作者
Li, Tianrui [1 ]
Ruan, Da
Geert, Wets
Song, Jing
Xu, Yang
机构
[1] SW Jiaotong Univ, Dept Math, Chengdu 610031, Peoples R China
[2] CEN SCK, Belgian Nucl Res Ctr, B-2400 Mol, Belgium
[3] Univ Ghent, Dept Appl Math & Comp Sci, B-9000 Ghent, Belgium
[4] Univ Hasselt, Dept Appl Econ Sci, B-3590 Diepenbeek, Belgium
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
rough sets; knowledge discovery; data mining; incomplete information systems;
D O I
10.1016/j.knosys.2007.01.002
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Any attribute set in an information system may be evolving in time when new information arrives. Approximations of a concept by rough set theory need updating for data mining or other related tasks. For incremental updating approximations of a concept, methods using the tolerance relation and similarity relation have been previously studied in literature. The characteristic relation-based rough sets approach provides more informative results than the tolerance-and-similarity relation based approach. In this paper, an attribute generalization and its relation to feature selection and feature extraction are firstly discussed. Then, a new approach for incrementally updating approximations of a concept is presented under the characteristic relation-based rough sets. Finally, the approach of direct computation of rough set approximations and the proposed approach of dynamic maintenance of rough set approximations are employed for performance comparison. An extensive experimental evaluation on a large soybean database from MLC shows that the proposed approach effectively handles a dynamic attribute generalization in data mining. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:485 / 494
页数:10
相关论文
共 32 条
  • [21] 2-Z
  • [22] LIU Q, 1999, J COMPUTER RES DEV, V36, P800
  • [23] ROUGH SETS
    PAWLAK, Z
    [J]. INTERNATIONAL JOURNAL OF COMPUTER & INFORMATION SCIENCES, 1982, 11 (05): : 341 - 356
  • [24] Pawlak Z., 1991, Rough sets: Theoretical aspects of reasoning about data, DOI DOI 10.1007/978-94-011-3534-4
  • [25] Pearson R. K., 2006, ACM SIGKDD EXPLORATI, V8, P83
  • [26] Classification of meteorological volumetric radar data using rough set methods
    Peters, JF
    Suraj, Z
    Shan, S
    Ramanna, S
    Pedrycz, W
    Pizzi, N
    [J]. PATTERN RECOGNITION LETTERS, 2003, 24 (06) : 911 - 920
  • [27] A comparative study of fuzzy rough sets
    Radzikowska, Anna Maria
    Kerre, Etienne E.
    [J]. FUZZY SETS AND SYSTEMS, 2002, 126 (02) : 137 - 155
  • [28] Incomplete information tables and rough classification
    Stefanowski, J
    Tsoukiàs, A
    [J]. COMPUTATIONAL INTELLIGENCE, 2001, 17 (03) : 545 - 566
  • [29] Stefanowski J, 1999, LECT NOTES ARTIF INT, V1711, P73
  • [30] Mining diagnostic rules from clinical databases using rough sets and medical diagnostic model
    Tsumoto, S
    [J]. INFORMATION SCIENCES, 2004, 162 (02) : 65 - 80