Mining non-redundant association rules

被引:243
作者
Zaki, MJ [1 ]
机构
[1] Rensselaer Polytech Inst, Dept Comp Sci, Troy, NY 12180 USA
基金
美国国家科学基金会;
关键词
association rule mining; frequent closed itemsets; formal concept analysis;
D O I
10.1023/B:DAMI.0000040429.96086.c7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The traditional association rule mining framework produces many redundant rules. The extent of redundancy is a lot larger than previously suspected. We present a new framework for associations based on the concept of closed frequent itemsets. The number of non-redundant rules produced by the new approach is exponentially (in the length of the longest frequent itemset) smaller than the rule set from the traditional approach. Experiments using several "hard" as well as "easy" real and synthetic databases confirm the utility of our framework in terms of reduction in the number of rules presented to the user, and in terms of time.
引用
收藏
页码:223 / 248
页数:26
相关论文
共 25 条
  • [1] [Anonymous], 3 INT C INF KNOWL MA
  • [2] [Anonymous], 1996, Advances in Knowledge Discovery and Data Mining, DOI DOI 10.1007/978-3-319-31750-2.
  • [3] [Anonymous], 1997, P 3 KDD
  • [4] [Anonymous], 1998, P 3 ACM SIGMOD WORKS
  • [5] [Anonymous], ACM SIGMOD C MAN DAT
  • [6] [Anonymous], 1991, MATHEMATIQUES INFORM
  • [7] Bastide Y., 2000, SIGKDD EXPLOR SPEC I, V2, P71
  • [8] BASTIDE Y, 2000, 1 INT C COMP LOG
  • [9] Bayardo R. J., 1999, 5 ACM SIGKDD INT C K
  • [10] BRIN S, 1997, ACM SIGMOD C MAN DAT