Mining association rules with multiple minimum supports: a new mining algorithm and a support tuning mechanism

被引:98
作者
Hu, Ya-Han [1 ]
Chen, Yen-Liang [1 ]
机构
[1] Natl Cent Univ, Dept Informat Management, Chungli 320, Taiwan
关键词
data mining; association rules; minimum supports; FP-tree;
D O I
10.1016/j.dss.2004.09.007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Mining association rules with multiple minimum supports is an important generalization of the association-rule-mining problem, which was recently proposed by Liu et al. Instead of setting a single minimum support threshold for all items, they allow users to specify multiple minimum supports to reflect the natures of the items, and an Apriori-based algorithm, named MSapriori, is developed to mine all frequent itemsets. In this paper, we study the same problem but with two additional improvements. First, we propose a FP-tree-like structure, MIS-tree, to store the crucial information about frequent patterns. Accordingly, an efficient MIS-tree-based algorithm, called the CFP-growth algorithm, is developed for mining all frequent itemsets. Second, since each item can have its own minimum support, it is very difficult for users to set the appropriate thresholds for all items at a time. In practice, users need to tune items' supports and run the mining algorithm repeatedly until a satisfactory end is reached. To speed up this time-consuming tuning process, an efficient algorithm which can maintain the MIS-tree structure without rescanning database is proposed. Experiments on both synthetic and real-life datasets show that our algorithms are much more efficient and scalable than the previous algorithm. (c) 2004 Elsevier B.V All rights reserved.
引用
收藏
页码:1 / 24
页数:24
相关论文
共 15 条
[1]  
Agrawal R, 1994, P 20 INT C VER LARG, V1215, P487
[2]  
[Anonymous], 2000, P 2000 ACM SIGMOD IN
[3]   Data mining: An overview from a database perspective [J].
Chen, MS ;
Han, JW ;
Yu, PS .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1996, 8 (06) :866-883
[4]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[5]   Incremental mining of frequent patterns without candidate generation or support constraint [J].
Cheung, W ;
Zaïane, OR .
SEVENTH INTERNATIONAL DATABASE ENGINEERING AND APPLICATIONS SYMPOSIUM, PROCEEDINGS, 2003, :111-116
[6]  
Feldman R., 1997, P ACM SIGMOD WORKSH, P59
[7]  
Han J., 2012, Data Mining, P393, DOI [DOI 10.1016/B978-0-12-381479-1.00009-5, 10.1016/B978-0-12-381479-1.00001-0]
[8]  
Han Y., 2021, P420
[9]  
Lee W. K., 1998, P 4 INT C KNOWL DISC
[10]  
Mannila H., 1998, P 4 INT C KNOWL DISC