Privacy preserving association rule mining over distributed databases using genetic algorithm

被引:4
作者
Bettahally N. Keshavamurthy
Asad M. Khan
Durga Toshniwal
机构
[1] Indian Institute of Technology,Department of Electronics and Computer Engineering
来源
Neural Computing and Applications | 2013年 / 22卷
关键词
Privacy preservation; Distributed databases; Genetic algorithm; Association rules mining; Trusted third-party computation;
D O I
暂无
中图分类号
学科分类号
摘要
Privacy preservation in distributed database is an active area of research. With the advancement of technology, massive amounts of data are continuously being collected and stored in distributed database applications. Indeed, temporal associations and correlations among items in large transactional datasets of distributed database can help in many business decision-making processes. One among them is mining frequent itemset and computing their association rules, which is a nontrivial issue. In a typical situation, multiple parties may wish to collaborate for extracting interesting global information such as frequent association, without revealing their respective data to each other. This may be particularly useful in applications such as retail market basket analysis, medical research, academic, etc. In the proposed work, we aim to find frequent items and to develop a global association rules model based on the genetic algorithm (GA). The GA is used due to its inherent features like robustness with respect to local maxima/minima and domain-independent nature for large space search technique to find exact or approximate solutions for optimization and search problems. For privacy preservation of the data, the concept of trusted third party with two offsets has been used. The data are first anonymized at local party end, and then, the aggregation and global association is done by the trusted third party. The proposed algorithms address various types of partitions such as horizontal, vertical, and arbitrary.
引用
收藏
页码:351 / 364
页数:13
相关论文
共 22 条
[1]  
Kantarcioglu M(2004)Privacy preserving distributed mining of association rules on horizontally partitioned data IEEE Trans Knowl Data Eng 16 6-396
[2]  
Clifton C(2006)Association rules mining in vertically partitioned databases IEEE Trans Knowl Data Eng 59 376-503
[3]  
Rozenberg B(2007)Privacy preserving algorithms for distributed mining of frequent itemsets Inf Sci 177 490-298
[4]  
Gudes E(2011)Multi objective association rule mining with genetic algorithm without specifying minimum support and minimum confidence Expert Syst Appl 38 288-13385
[5]  
Zhong S(2011)A novel evolutionary method to search interesting association rules by key words J Expert Syst Appl 38 13378-689
[6]  
Qodmanan HR(2005)ARMGA: Identifying interesting association rules with genetic algorithms Appl Artif Intell 19 677-103
[7]  
Nasiri M(2010)Review of classification using genetic programming Int J Eng Sci Technol 2 94-60
[8]  
Bidgoli BM(2009)An autonomous GP-based system for regression and classification problems Appl Soft Comput 9 49-1129
[9]  
Yang G(2012)Hiding co-occurring prioritized sensitive patterns over distributed progressive sequential data streams J Netw Comput Appl 35 1116-undefined
[10]  
Mabu S(undefined)undefined undefined undefined undefined-undefined