Efficient incremental maintenance of frequent patterns with FP-tree

被引:13
作者
Ma, XL [1 ]
Tong, YH
Tang, SW
Yang, DQ
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[2] Peking Univ, Natl Lab Machine Percept, Beijing 100871, Peoples R China
关键词
data mining; association rule mining; frequent pattern mining; incremental mining;
D O I
10.1007/BF02973451
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Mining frequent patterns has been studied popularly in data mining area. However, little work has been done on mining patterns when the database has an influx of fresh data constantly. In these dynamic scenarios, efficient maintenance of the discovered patterns is crucial. Most existing methods need to scan the entire database repeatedly, which is an obvious disadvantage. In this paper, an efficient incremental mining algorithm, Incremental-Mining (IM), is proposed for maintenance of the frequent patterns when new incremental data come. Based on the frequent pattern tree (FP-tree) structure, IM gives a way to make the most of the things from the previous mining process, and requires scanning the original data once at most. Furthermore, IM can identify directly the differential set of frequent patterns, which may be more informative to users. Moreover, IM can deal with changing thresholds as well as changing data, thus provide a full maintenance scheme. IM has been implemented and the performance study shows it outperforms three other incremental algorithms: FUP, DB-tree and re-running frequent pattern growth (FP-growth).
引用
收藏
页码:876 / 884
页数:9
相关论文
共 13 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
[Anonymous], LNCS LNAI
[3]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[4]  
CHEUNG DW, 1997, P 5 INT C DAT SYST A, P185
[5]  
Davey B. A., 1990, INTRO LATTICES ORDER
[6]   Maintaining discovered frequent itemsets: Cases for changeable database and support [J].
Du, XP ;
Tang, SW ;
Makinouchi, A .
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2003, 18 (05) :648-658
[7]  
Han H S, 2000, Korean J Ophthalmol, V14, P1
[8]  
LEE S, 1997, P 1997 SIGMOD WORKSH
[9]  
LIU J, 2001, LECT NOTES ARTIF INT, V2035, P406
[10]  
MA X, 2002, LECT NOTES COMPUTER, V2419, P80