Fast parallel association rule mining without candidacy generation

被引:69
作者
Zaïane, OR [1 ]
El-Hajj, M [1 ]
Lu, P [1 ]
机构
[1] Univ Alberta, Edmonton, AB T6G 2M7, Canada
来源
2001 IEEE INTERNATIONAL CONFERENCE ON DATA MINING, PROCEEDINGS | 2001年
关键词
D O I
10.1109/ICDM.2001.989600
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce a new parallel algorithm MLFPT (Multiple Local Frequent Pattern Tree) [11] for parallel mining of frequent patterns, based on FP-growth mining, that uses only, two full I/O scans of the database, eliminating the need for generating the candidate items, and distributing the work fairly among processors. We have devised partitioning strategies at different stages of the mining process to achieve near optimal balancing between processors. We have successfully tested our algorithm on datasets larger than 50 million transactions.
引用
收藏
页码:665 / 668
页数:4
相关论文
共 13 条
[11]  
Zaiane O. R., 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073), P461, DOI 10.1109/ICDE.2000.839445
[12]  
ZAIANE OR, 2001, TR0112 U ALB DEP COM
[13]   Parallel and distributed association mining: A survey [J].
Zaki, MJ .
IEEE CONCURRENCY, 1999, 7 (04) :14-25