基于排序FP-树的频繁模式高效挖掘算法

被引：13

作者：

秦亮曦 ^{[1
]}

李谦 ^{[2
]}

史忠植 ^{[1
]}

机构：

[1] 中国科学院计算技术研究所智能信息处理重点实验室

[2] 中国科学院研究生院

来源：

计算机科学 | 2005年 / 04期

关键词：

数据挖掘; 关联规则; 频繁模式; 排序FP-树;

D O I：

暂无

中图分类号：

TP311.13 [];

学科分类号：

1201 ;

摘要：

FP-growth算法是目前较高效的频繁模式挖掘算法之一。在FP-growth算法中,FP-树及条件FP-树的构造和遍历占了算法绝大部分的时间,如果能减少这方面的时间,则有望进一步改善算法的效率。本文给出了一个频繁模式挖掘算法SFP-growth。算法通过将FP-树有序化及采用高效排序算法等措施来提高FP-树构造的效率,从而使算法达到较高的效率。实验结果表明,SFP-growth是一个高效的频繁模式挖掘算法,其性能优于Apriori、Eclat和FP-growtn算法。

引用

页码：31 / 33

页数：3

共 4 条

[1]

Mining frequent patterns without candidate generatioh. Han J,Pei J,Yin Y. Proc. of 2000 ACM-SIGMOD Int‘l Conf on Management of Data (SIGMOD‘00) . 2000

[2]

Mining association rules between sets of items in large database. Agrawal R,Imielinski T,Swami A. Proc. of 1993 ACM SIGMOD Conf. on Management of Data . 1993

[3]

Fast algorithms for mining association rules. Agrawal R,Srikant R. Proc. of the 20th Int‘l Conf. on Very Large DataBases （VLDB‘94） . 1994

[4]

New algorithms for fast discovery of association rules. Zaki M,Parthasarathy S,Ogihara M,Li W. Proc of the Third Intl. Conf. on Knowledge Discovery and Data Mining (KDD‘97) . 1997

← 1 →