estWin:: Online data stream mining of recent frequent itemsets by sliding window method

被引:44
作者
Chang, JH [1 ]
Lee, WS [1 ]
机构
[1] Yonsei Univ, Dept Comp Sci, Seoul 120749, South Korea
关键词
recent change of data streams; sliding window; data streams; delayed-insertion; itemset pruning;
D O I
10.1177/0165551505050785
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Knowledge embedded in a data stream is likely to be changed as time goes by. Identifying the recent change of the knowledge quickly can provide valuable information for the analysis of the data stream. However, most mining algorithms over a data stream are not able to extract the recent change of knowledge in a data stream adaptively. This is because the obsolete information of old data elements which may be no longer useful or possibly invalid at present is regarded as being as important as that of recent data elements. This paper proposes a sliding window method that finds recently frequent itemsets over a transactional online data stream adaptively. The size of a sliding window defines the desired life-time of information in a newly generated transaction. Consequently, only recently generated transactions in the range of the window are considered to find the recently frequent itemsets of a data stream.
引用
收藏
页码:76 / 90
页数:15
相关论文
共 23 条
[1]  
AGARWAL RC, 1997, P 6 ACM SIGKDD INT C, P108
[2]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[3]  
[Anonymous], P INT C VER LARG DAT
[4]   Borders: An efficient algorithm for association generation in dynamic databases [J].
Aumann, Y ;
Feldman, R ;
Lipshtat, O ;
Manilla, H .
JOURNAL OF INTELLIGENT INFORMATION SYSTEMS, 1999, 12 (01) :61-73
[5]  
Brin S., 1997, SIGMOD Record, V26, P255, DOI [10.1145/253262.253327, 10.1145/253262.253325]
[6]  
Charikar M., 2002, P 29 INT C AUT LANG, P693, DOI 10.1007/3-540-45465-9_59
[7]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[8]  
CHEUNG DW, 1997, P 5 INT C DAT SYST A, P185
[9]  
Cormode G., 2003, ACM Transactions on Database Systems (TODS), P296, DOI DOI 10.1145/1061318.1061325
[10]  
Datar M, 2002, SIAM PROC S, P635