Maintenance of fast updated frequent pattern trees for record deletion

被引:35
作者
Hong, Tzung-Pei [1 ,2 ]
Lin, Chun-Wei [3 ]
Wu, Yu-Lung [4 ]
机构
[1] Natl Univ Kaohsiung, Dept Comp Sci & Informat Engn, Kaohsiung, Taiwan
[2] Natl Sun Yat Sen Univ, Dept Comp Sci & Engn, Kaohsiung, Taiwan
[3] Natl Cheng Kung Univ, Dept Comp Sci & Informat Engn, Tainan 70101, Taiwan
[4] I Shou Hosp, Dept Informat Management, Kaohsiung, Taiwan
关键词
17;
D O I
10.1016/j.csda.2009.01.015
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
The Frequent-Pattern-tree (FP tree) is an efficient data structure for association-rule mining without generation of candidate itemsets. It was used to represent a database into a tree structure which stored only frequent items. It, however, needed to process all transactions in a batch way. In the past, Hong et al. thus proposed an efficient incremental mining algorithm for handling newly inserted transactions. In addition to record insertion, record deletion from databases is also commonly seen in real-applications. In this paper, we thus attempt to modify the FP-tree construction algorithm for efficiently handling deletion of records. A fast updated FP-tree (FUFP-tree) structure is used, which makes the tree update process become easier. An FUFP-tree maintenance algorithm for the deletion of records is also proposed for reducing the execution time in reconstructing the tree when records are deleted. Experimental results also show that the proposed FUFP-tree maintenance algorithm for deletion of records runs faster than the batch FP-tree construction algorithm for handling deleted records and generates nearly the same tree structure as the FP-tree algorithm. The proposed approach can thus achieve a good trade-off between execution time and tree complexity. (C) 2009 Elsevier B.V. All rights reserved.
引用
收藏
页码:2485 / 2499
页数:15
相关论文
共 17 条
[1]
[Anonymous], ACM SIGACT SIGMOD SI
[2]
[Anonymous], 21 INT C VER LARG DA
[3]
[Anonymous], 11 IEEE INT C DAT EN
[4]
[Anonymous], 3 INT C KNOWL DISC D
[5]
[Anonymous], 21 INT C VER LARG DA
[6]
[Anonymous], 1994, COMPLAINTS 20 INT CO
[7]
[Anonymous], 1993, P ACM SIGMOD INT C M, DOI DOI 10.1145/170035.170072
[8]
CHEUNG DW, 1996, 12 IEEE INT C DAT EN, P106
[9]
Ezeife C.I., 2002, Advances in Artificial Intelligence, V2338, P147
[10]
Han J., 2000, 2000 ACM SIGMOD INT