Incremental updates of closed frequent itemsets over continuous data streams

被引:36
作者
Li, Hlia-Fu [1 ]
Ho, Chin-Chuan [2 ]
Lee, Suh-Yin [2 ]
机构
[1] Kainan Univ, Dept Comp Sci, Tao Yuan, Taiwan
[2] Natl Chiao Tung Univ, Dept Comp Sci, Hsinchu, Taiwan
关键词
Data mining; Data streams; Closed frequent itemsets; Single-pass mining; Incremental update;
D O I
10.1016/j.eswa.2007.12.054
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
Online mining of closed frequent itemsets over streaming data is one of the most important issues in mining data streams. In this paper, we propose an efficient one-pass algorithm, NewMoment to maintain the set of closed frequent itemsets in data streams with a transaction-sensitive sliding window. An effective bit-sequence representation of items is used in the proposed algorithm to reduce the time and memory needed to slide the windows. Experiments show that the proposed algorithm not only attain highly accurate mining results. but also run significant faster and consume less memory than existing algorithm Moment for mining closed frequent itemsets over recent data streams (C) 2007 Elsevier Ltd. All rights reserved.
引用
收藏
页码:2451 / 2458
页数:8
相关论文
共 19 条
[1]
Agarwal R., 1994, VLDB, V487, P499, DOI DOI 10.5555/645920.672836
[2]
Babcock B., 2002, PODS, P1, DOI [DOI 10.1145/543613.543615, 10.1145/543613.543615]
[3]
Could a laptop computer plus the liquid crystal display projector amount to improved multimedia geoscience instruction? [J].
Chang, CY .
JOURNAL OF COMPUTER ASSISTED LEARNING, 2004, 20 (01) :4-10
[4]
CHANG J, 2004, IEICE T INFORM SYS D, V87
[5]
Moment: Maintaining closed frequent itemsets over a stream sliding window [J].
Chi, Y ;
Wang, HX ;
Yu, PS ;
Muntz, RR .
FOURTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2004, :59-66
[6]
Giannella C., 2003, Data mining: Next generation challenges and future directions
[7]
Golab L, 2003, SIGMOD REC, V32, P5, DOI 10.1145/776985.776986
[8]
HAN J, 2000, P 2000 ACM SIGMOD IN, P1, DOI DOI 10.1145/342009.335372
[9]
JIANG N, 2006, ACM SIGMOD RECORD, V35
[10]
JIN R, 2005, P 5 IEEE INT C DAT M