基于布尔矩阵和MapReduce的FP-Growth算法

被引:49
作者
陈兴蜀
张帅
童浩
崔晓靖
机构
[1] 四川大学计算机学院
关键词
数据挖掘; 关联规则; 布尔矩阵; MapReduce; FP-Growth算法;
D O I
暂无
中图分类号
TP311.13 [];
学科分类号
摘要
关联规则挖掘是数据挖掘的一个重要组成部分.为提高关联规则的挖掘效率,提出了一种基于布尔矩阵和MapReduce的FP-Growth算法(BPFP),分析了算法的时间和空间复杂度.该算法使用Hadoop框架和布尔矩阵以减少对事务数据的扫描次数,利用两次MapReduce来实现频繁项集的挖掘.在多个数据集上的实验结果表明,与原FP-Growth算法相比,BPFP算法具有更高的执行效率、更好的加速比.
引用
收藏
页码:135 / 141
页数:7
相关论文
共 8 条
[1]
A decentralized approach for mining event correlations in distributed system monitoring.[J].Gang Wu;Huxing Zhang;Meikang Qiu;Zhong Ming;Jiayin Li;Xiao Qin.Journal of Parallel and Distributed Computing.2012,
[2]
An Improved Association Rules Algorithm Based on Frequent Item Sets.[J].Yaqiong Jiang;Jun Wang.Procedia Engineering.2011,
[3]
An Improved Apriori Algorithm Based On the Boolean Matrix and Hadoop.[J].Honglie Yu;Jun Wen;Hongmei Wang;Li Jun.Procedia Engineering.2011,
[4]
A load-balanced distributed parallel mining algorithm.[J].Kun-Ming Yu;Jiayi Zhou;Tzung-Pei Hong;Jia-Ling Zhou.Expert Systems With Applications.2009, 3
[5]
Performance study of distributed Apriori-like frequent itemsets mining [J].
Aouad, Lamine M. ;
Le-Khac, Nhien-An ;
Kechadi, Tahar M. .
KNOWLEDGE AND INFORMATION SYSTEMS, 2010, 23 (01) :55-72
[6]
MapReduce.[J].Jeffrey Dean;Sanjay Ghemawat.Communications of the ACM.2008, 1
[7]
Computing in the clouds.[J].Aaron Weiss.netWorker.2007, 4
[8]
Discovering all most specific sentences [J].
Gunopulos, D ;
Khardon, R ;
Mannila, H ;
Saluja, S ;
Toivonen, H ;
Sharma, RS .
ACM TRANSACTIONS ON DATABASE SYSTEMS, 2003, 28 (02) :140-174