Privacy preserving data mining: A noise addition framework using a novel clustering technique

被引:85
作者
Islam, Md Zahidul [1 ]
Brankovic, Ljiljana [2 ]
机构
[1] Charles Sturt Univ, Sch Comp & Math, Wagga Wagga, NSW 2678, Australia
[2] Univ Newcastle, Sch Elect Engn & Comp Sci, Callaghan, NSW 2308, Australia
基金
澳大利亚研究理事会;
关键词
Privacy preserving data mining; Clustering categorical values; Data perturbation; Noise addition; Data mining; Privacy analysis; ATTRIBUTES;
D O I
10.1016/j.knosys.2011.05.011
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
During the whole process of data mining (from data collection to knowledge discovery) various sensitive data get exposed to several parties including data collectors, cleaners, preprocessors, miners and decision makers. The exposure of sensitive data can potentially lead to breach of individual privacy. Therefore, many privacy preserving techniques have been proposed recently. In this paper we present a framework that uses a few novel noise addition techniques for protecting individual privacy while maintaining a high data quality. We add noise to all attributes, both numerical and categorical. We present a novel technique for clustering categorical values and use it for noise addition purpose. A security analysis is also presented for measuring the security level of a data set. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1214 / 1223
页数:10
相关论文
共 29 条
[1]
Agrawal R., 2000, Privacy-preserving data mining, P439, DOI DOI 10.1145/342009.335438
[2]
[Anonymous], P 9 ACM SIGKDD INT C
[3]
BRANKOVIC L, P AUSTR I COMP ETH C
[4]
Estivill-Castro V., 1999, P DAT WAR KNOWL DISC
[5]
Evfimievski A. V., P 8 ACM SIGKDD INT C, P217
[6]
FULE P, P 27 AUSTR COMP SCI, V26, P163
[7]
Ganti V., 1999, P ACM SIGKDD INT C K
[8]
Islam M. Z., 2008, THESIS U NEWCASTLE A
[9]
ISLAM MZ, LNCS, V6121
[10]
ISLAM MZ, P 3 INT IEEE C IND I