机构:
IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USAIBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
Aggarwal, CG
[1
]
Yu, PS
论文数: 0引用数: 0
h-index: 0
机构:
IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USAIBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
Yu, PS
[1
]
机构:
[1] IBM Corp, Thomas J Watson Res Ctr, Yorktown Heights, NY 10598 USA
来源:
14TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, PROCEEDINGS
|
1998年
关键词:
D O I:
10.1109/ICDE.1998.655803
中图分类号:
TP [自动化技术、计算机技术];
学科分类号:
0812 ;
摘要:
We have a large database consisting of sales transactions. We investigate the problem of online mining of association rules in this large database. We show how to preprocess the data effectively in order to make it suitable for repeated online queries. The preprocessing algorithm takes into account the storage space available. We store the preprocessed data in such a way that online processing may be done by applying a graph theoretic search algorithm whose complexity is proportional to the size of the output. This results in an online algorithm which is practically instantaneous in terms of response time. The algorithm also supports techniques for quickly discovering association rules from large itemsets. The algorithm is capable of finding rules with specific items in the antecedent or consequent. These association rules are presented in a compact form, eliminating redundancy We believe that the elimination of redundancy in online generation of association rules from large itemsets is interesting in its own right.