Efficient Mining of High Utility Patterns over Data Streams with a Sliding Window Method

被引:7
作者
Ahmed, Chowdhury Farhan [1 ]
Tanbeer, Syed Khairuzzaman [1 ]
Jeong, Byeong-Soo [1 ]
机构
[1] Kyung Hee Univ, Database Lab, Dept Comp Engn, Youngin Is 446701, Kyunggi Do, South Korea
来源
SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL-DISTRIBUTED COMPUTING 2010 | 2010年 / 295卷
关键词
ITEMSET UTILITIES; FREQUENT; TREE; ALGORITHM;
D O I
10.1007/978-3-642-13265-0_8
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
High utility pattern (HUP) mining over data streams has become a challenging research issue in data mining. The existing sliding window-based HUP mining algorithms over stream data suffer from the level-wise candidate generation-and-test problem. Therefore, they need a large amount of execution time and memory. Moreover, their data structures are not suitable for interactive mining. To solve these problems of the existing algorithms, in this paper, we propose a new tree structure, called HUS-tree (High Utility Stream tree) and a novel algorithm, called HUPMS (HOP Mining over Stream data), for sliding window-based HUP mining over data streams. By capturing the important information of the stream data into an HUS-tree, our HUPMS algorithm can mine all the HUPs in the current window with a pattern growth approach. Moreover, HUS-tree is very efficient for interactive mining. Extensive performance analyses show that our algorithm significantly outperforms the existing sliding window-based HUP mining algorithms.
引用
收藏
页码:99 / 113
页数:15
相关论文
共 20 条
[1]
Agrawal R., 1994, VLDB 1994, P487
[2]
[Anonymous], FREQUENT ITEMSET MIN
[3]
estWin:: Online data stream mining of recent frequent itemsets by sliding window method [J].
Chang, JH ;
Lee, WS .
JOURNAL OF INFORMATION SCIENCE, 2005, 31 (02) :76-90
[4]
An efficient algorithm for mining temporal high utility itemsets from data streams [J].
Chu, Chun-Jung ;
Tseng, Vincent S. ;
Liang, Tyne .
JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (07) :1105-1117
[5]
CTU-Mine: An efficient high utility itemset mining algorithm using the pattern growth approach [J].
Erwin, Alva ;
Gopalan, Raj P. ;
Achuthan, N. R. .
2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, :71-+
[6]
Fast algorithms for frequent itemset mining using FP-trees [J].
Grahne, G ;
Zhu, JF .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362
[7]
Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].
Han, JW ;
Pei, J ;
Yin, YW ;
Mao, RY .
DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87
[8]
CanTree: a canonical-order tree for incremental frequent-pattern mining [J].
Leung, Carson Kai-Sang ;
Khan, Quamrul I. ;
Li, Zhan ;
Hoque, Tariqul .
KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (03) :287-311
[9]
Leung CKS, 2006, IEEE DATA MINING, P928
[10]
Fast and Memory Efficient Mining of High Utility Itemsets in Data Streams [J].
Li, Hua-Fu ;
Huang, Hsin-Yun ;
Chen, Yi-Cheng ;
Liu, Yu-Jiun ;
Lee, Suh-Yin .
ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :881-+