EXPEDITE: EXPress closED ITemset Enumeration

被引:4
作者
Aliberti, Giulio [1 ]
Colantonio, Alessandro [2 ]
Di Pietro, Roberto [1 ]
Mariani, Riccardo [2 ]
机构
[1] Roma Tre Univ, Dept Math & Phys, Rome, Italy
[2] Bay31 AG, Zug, Switzerland
关键词
Data mining; Knowledge discovery; Closed itemsets; Frequent itemsets; Algorithms; EFFICIENT ALGORITHM; FREQUENT PATTERNS; DISCOVERY;
D O I
10.1016/j.eswa.2014.12.031
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
In this paper, we introduce EXPress closED ITemset Enumeration (Exmorre), a new frequent closed itemset (FCI) miner designed to speed up the process of FM extraction from a dataset of transactions. Compared to the state of the art, ExProrre provides a CPU time saving of up to two orders of magnitude without compromising other dimensions of performance (e.g. memory). The reason why it is so fast is that EXPEDITE wastes less time in mining intermediate item sets that are discarded in later phases of the algorithm. More specifically, it cuts down the number of both duplicate FCIs those generated multiple times by the algorithm and infrequent itemsets those with low support or no supporting transactions. This feature, enjoyable by both sparse and dense datasets, is analytically motivated first, and then experimentally supported by extensive tests on real datasets. As a further contribution, we propose two alternative implementations of EXPEDITE that perform even better than the basic version, although they rely on particular features of the input dataset. (C) 2014 Elsevier Ltd. All rights reserved.
引用
收藏
页码:3933 / 3944
页数:12
相关论文
共 54 条
[1]
Aggarwal C. C., 2001, SIGKDD EXPLORATIONS, V3, P20
[2]
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[3]
AGRAWAL R, 1995, PROC INT CONF DATA, P3, DOI 10.1109/ICDE.1995.380415
[4]
Agrawal Rakesh., 1994, P 20 INT C VER LARG, P487
[5]
[Anonymous], LNAI
[6]
[Anonymous], 2003, P 9 ACM SIGKDD INT C, DOI [10.1145/956750.956779, DOI 10.1145/956750.956779]
[7]
[Anonymous], 2000, SIGKDD EXPLORATIONS
[8]
[Anonymous], 2000, ACM SIGMOD WORKSH RE
[9]
[Anonymous], CLA
[10]
DBV-Miner: A Dynamic Bit-Vector approach for fast mining frequent closed itemsets [J].
Bay Vo ;
Hong, Tzung-Pei ;
Bac Le .
EXPERT SYSTEMS WITH APPLICATIONS, 2012, 39 (08) :7196-7206