Mining dynamic association rules with comments

被引:16
作者
Shen, Bin [1 ,2 ]
Yao, Min [1 ]
Wu, Zhaohui [1 ]
Gao, Yunjun [1 ,3 ]
机构
[1] Zhejiang Univ, Coll Comp Sci & Technol, Hangzhou 310027, Peoples R China
[2] Zhejiang Univ, Ningbo Inst Technol, Ningbo 315100, Zhejiang, Peoples R China
[3] Singapore Management Univ, Sch Informat Syst, Singapore 178902, Singapore
基金
中国国家自然科学基金;
关键词
Dynamic association rule; Comment; Support vector; Confidence vector; Mining algorithm; PATTERNS;
D O I
10.1007/s10115-009-0207-1
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we study a new problem of mining dynamic association rules with comments (DAR-C for short). A DAR-C contains not only rule itself, but also its comments that specify when to apply the rule. In order to formalize this problem, we first present the expression method of candidate effective time slots, and then propose several definitions concerning DAR-C. Subsequently, two algorithms, namely ITS2 and EFP-Growth2, are developed for handling the problem of mining DAR-C. In particular, ITS2 is an improved two-stage dynamic association rule mining algorithm, while EFP-Growth2 is based on the EFP-tree structure and is suitable for mining high-density mass data. Extensive experimental results demonstrate that the efficiency and scalability of our proposed two algorithms (i.e., ITS2 and EFP-Growth2) on DAR-C mining tasks, and their practicability on real retail dataset.
引用
收藏
页码:73 / 98
页数:26
相关论文
共 30 条
[1]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[2]  
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[3]  
Agrawal R., 1994, P 20 INT C VER LARG, P487, DOI DOI 10.5555/645920.672836
[4]  
Au WH, 2002, PROCEEDINGS OF THE 2002 IEEE INTERNATIONAL CONFERENCE ON FUZZY SYSTEMS, VOL 1 & 2, P890, DOI 10.1109/FUZZ.2002.1006622
[5]   Mining changes in association rules: a fuzzy approach [J].
Au, WH ;
Chan, KCC .
FUZZY SETS AND SYSTEMS, 2005, 149 (01) :87-104
[6]   MAFIA: A maximal frequent itemset algorithm for transactional databases [J].
Burdick, D ;
Calimlim, M ;
Gehrke, J .
17TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS, 2001, :443-452
[7]   A survey on algorithms for mining frequent itemsets over data streams [J].
Cheng, James ;
Ke, Yiping ;
Ng, Wilfred .
KNOWLEDGE AND INFORMATION SYSTEMS, 2008, 16 (01) :1-27
[8]   Maintenance of discovered association rules in large databases: Art incremental updating technique [J].
Cheung, DW ;
Han, JW ;
Ng, VT ;
Wong, CY .
PROCEEDINGS OF THE TWELFTH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, 1996, :106-114
[9]   Mining border descriptions of emerging patterns from dataset pairs [J].
Dong, GZ ;
Li, JY .
KNOWLEDGE AND INFORMATION SYSTEMS, 2005, 8 (02) :178-202
[10]  
Ganti V., 1999, Proceedings of the Eighteenth ACM SIGMOD-SIGACT-SIGART Symposium on Principles of Database Systems, P126, DOI 10.1145/303976.303989