Maintaining discovered frequent itemsets: Cases for changeable database and support

被引:2
作者
Du, XP [1 ]
Tang, SW
Makinouchi, A
机构
[1] Peking Univ, Sch Elect Engn & Comp Sci, Beijing 100871, Peoples R China
[2] Beijing Univ Aeronaut & Astronaut, Coll Software, Beijing 100083, Peoples R China
[3] Kyushu Univ, Grad Sch Informat Sci, Fukuoka 8128581, Japan
关键词
frequent itemset; association rule; algorithm; data mining; database;
D O I
10.1007/BF02947125
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Mining frequent itemsets from large databases has played an essential role in many data mining tasks. It is also important to maintain the discovered frequent itemsets for these data mining tasks when the database is updated. All algorithms proposed so far for the maintenance of discovered frequent itemsets are only performed with a fixed minimum support, which is the same as that used to obtain the discovered frequent itemsets. That is, users cannot change the minimum support even if the new results are unsatisfactory to the users. In this paper two new complementary algorithms, FMP (First Maintaining Process) and RMP (Repeated Maintaining Process), are proposed to maintain discovered frequent itemsets in the case that new transaction data are added to a transaction database. Both algorithms allow users to change the minimum support for the maintenance processes. FMP is used for the first maintaining process, and when the result derived from the FMP is unsatisfactory, RMP will be performed repeatedly until satisfactory results are obtained. The proposed algorithms re-use the previous results to cut down the cost of maintenance. Extensive experiments have been conducted to assess the performance of the algorithms. The experimental results show that the proposed algorithms are very resultful compared with the previous mining and maintenance algorithms for maintenance of discovered frequent itemsets.
引用
收藏
页码:648 / 658
页数:11
相关论文
共 24 条
[1]   A tree projection algorithm for generation of frequent item sets [J].
Agarwal, RC ;
Aggarwal, CC ;
Prasad, VVV .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (03) :350-371
[2]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[3]  
Agrawal R., 1994, P 20 INT C VER LARG, V1215, P487
[4]  
BAYARDO RJ, SIGMOD 98, P85
[5]  
Cheung D. W., 1996, KDD-96 Proceedings. Second International Conference on Knowledge Discovery and Data Mining, P307
[6]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[7]  
CHEUNG DW, 1997, P 5 INT C DAT SYST A, P185
[8]  
Du X., 2000, Research Reports on Information Science and Electrical Engineering of Kyushu University, V5, P81
[9]  
DU XP, 2000, P INT C INF SOC 21 C, P408
[10]  
DU XP, 1999, ICEIS 1999, P65