An Efficient Algorithm for Mining Closed Frequent Itemsets in Data Streams

被引:10
作者
Ao, Fujiang [1 ]
Du, Jing [2 ]
Yan, Yuejin [2 ]
Liu, Baohong [1 ]
Huang, Kedi [1 ]
机构
[1] Natl Univ Def Technol, Sch Mech Engn & Automat, Changsha 410073, Hunan, Peoples R China
[2] Natl Univ Def Technol, Sch Comp Sci, Changsha 410073, Peoples R China
来源
8TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY WORKSHOPS: CIT WORKSHOPS 2008, PROCEEDINGS | 2008年
关键词
D O I
10.1109/CIT.2008.Workshops.52
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining closed frequent itemsets in the sliding window is one of important topics of data streams mining. In this paper, we propose a novel algorithm, FPCFI-DS, which mines closed frequent itemsets in the sliding window of data streams efficiently, and maintains the precise closed frequent itemsets in the current window at any time. The algorithm uses a single-pass lexicographical-order FP-Tree-based algorithm with mixed item ordering policy to mine the closed frequent itemsets in the first window, and introduces a novel updating approach to process the sliding of window. The experimental results show that FPCFI-DS performs better than the state-of-the-art algorithm Moment in terms of both the time and space efficiencies, especially for dense dataset or low minimum support.
引用
收藏
页码:37 / +
页数:2
相关论文
共 15 条
  • [1] Agrawal R., 1994, Proceedings of the 20th International Conference on Very Large Data Bases. VLDB'94, P487
  • [2] AO F, 2007, P MLDM 07 LEIPZ GERM, P479
  • [3] Babcock B., 2002, Proceedings of the Twenty-First ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems (PODS), P1, DOI DOI 10.1145/543613.543615
  • [4] MAFIA: A maximal frequent itemset algorithm for transactional databases
    Burdick, D
    Calimlim, M
    Gehrke, J
    [J]. 17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, : 443 - 452
  • [5] Chang JH, 2003, P 9 ACM SIGKDD INT C, P487, DOI DOI 10.1145/956750.956807
  • [6] Moment: Maintaining closed frequent itemsets over a stream sliding window
    Chi, Y
    Wang, HX
    Yu, PS
    Muntz, RR
    [J]. FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, : 59 - 66
  • [7] GRAHNE G, 2003, P FIMI 03 MELB FLOR
  • [8] Jiang N., 2006, KDD, P592
  • [9] LI H, 2006, P IWMESD 06 HONG KON, P672
  • [10] Online mining (recently) maximal frequent itemsets over data streams
    Li, HF
    Lee, SY
    Shan, MK
    [J]. 15th International Workshop on Research Issues in Data Engineering: Stream Data Mining and Applications, Proceedings, 2005, : 11 - 18