Efficient Mining of High Utility Patterns over Data Streams with a Sliding Window Method

被引：7

作者：

Ahmed, Chowdhury Farhan ^{[1
]}

Tanbeer, Syed Khairuzzaman ^{[1
]}

Jeong, Byeong-Soo ^{[1
]}

机构：

[1] Kyung Hee Univ, Database Lab, Dept Comp Engn, Youngin Is 446701, Kyunggi Do, South Korea

来源：

SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCE, NETWORKING AND PARALLEL-DISTRIBUTED COMPUTING 2010 | 2010年 / 295卷

关键词：

ITEMSET UTILITIES; FREQUENT; TREE; ALGORITHM;

D O I：

10.1007/978-3-642-13265-0_8

中图分类号：

TP18 [人工智能理论];

学科分类号：

140502 [人工智能];

摘要：

High utility pattern (HUP) mining over data streams has become a challenging research issue in data mining. The existing sliding window-based HUP mining algorithms over stream data suffer from the level-wise candidate generation-and-test problem. Therefore, they need a large amount of execution time and memory. Moreover, their data structures are not suitable for interactive mining. To solve these problems of the existing algorithms, in this paper, we propose a new tree structure, called HUS-tree (High Utility Stream tree) and a novel algorithm, called HUPMS (HOP Mining over Stream data), for sliding window-based HUP mining over data streams. By capturing the important information of the stream data into an HUS-tree, our HUPMS algorithm can mine all the HUPs in the current window with a pattern growth approach. Moreover, HUS-tree is very efficient for interactive mining. Extensive performance analyses show that our algorithm significantly outperforms the existing sliding window-based HUP mining algorithms.

引用

页码：99 / 113

页数：15

共 20 条

[1]

Agrawal R., 1994, VLDB 1994, P487

[2]

[Anonymous], FREQUENT ITEMSET MIN

[3]

estWin:: Online data stream mining of recent frequent itemsets by sliding window method [J].

Chang, JH ;

Lee, WS .

JOURNAL OF INFORMATION SCIENCE, 2005, 31 (02) :76-90

[4]

An efficient algorithm for mining temporal high utility itemsets from data streams [J].

Chu, Chun-Jung ;

Tseng, Vincent S. ;

Liang, Tyne .

JOURNAL OF SYSTEMS AND SOFTWARE, 2008, 81 (07) :1105-1117

[5]

CTU-Mine: An efficient high utility itemset mining algorithm using the pattern growth approach [J].

Erwin, Alva ;

Gopalan, Raj P. ;

Achuthan, N. R. .

2007 CIT: 7TH IEEE INTERNATIONAL CONFERENCE ON COMPUTER AND INFORMATION TECHNOLOGY, PROCEEDINGS, 2007, :71-+

[6]

Fast algorithms for frequent itemset mining using FP-trees [J].

Grahne, G ;

Zhu, JF .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2005, 17 (10) :1347-1362

[7]

Mining frequent patterns without candidate generation: A frequent-pattern tree approach [J].

Han, JW ;

Pei, J ;

Yin, YW ;

Mao, RY .

DATA MINING AND KNOWLEDGE DISCOVERY, 2004, 8 (01) :53-87

[8]

CanTree: a canonical-order tree for incremental frequent-pattern mining [J].

Leung, Carson Kai-Sang ;

Khan, Quamrul I. ;

Li, Zhan ;

Hoque, Tariqul .

KNOWLEDGE AND INFORMATION SYSTEMS, 2007, 11 (03) :287-311

[9]

Leung CKS, 2006, IEEE DATA MINING, P928

[10]

Fast and Memory Efficient Mining of High Utility Itemsets in Data Streams [J].

Li, Hua-Fu ;

Huang, Hsin-Yun ;

Chen, Yi-Cheng ;

Liu, Yu-Jiun ;

Lee, Suh-Yin .

ICDM 2008: EIGHTH IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS, 2008, :881-+

← 1 2 →