Information Security in Big Data: Privacy and Data Mining

被引:353
作者
Xu, Lei [1 ]
Jiang, Chunxiao [1 ]
Wang, Jian [1 ]
Yuan, Jian [1 ]
Ren, Yong [1 ]
机构
[1] Tsinghua Univ, Dept Elect Engn, Beijing 100084, Peoples R China
来源
IEEE ACCESS | 2014年 / 2卷
基金
高等学校博士学科点专项科研基金; 中国国家自然科学基金;
关键词
Data mining; sensitive information; privacy-preserving data mining; anonymization; provenance; game theory; privacy auction; anti-tracking; TRAJECTORY DATA; ANONYMIZATION; ANONYMITY;
D O I
10.1109/ACCESS.2014.2362522
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The growing popularity and development of data mining technologies bring serious threat to the security of individual's sensitive information. An emerging research topic in data mining, known as privacy-preserving data mining (PPDM), has been extensively studied in recent years. The basic idea of PPDM is to modify the data in such a way so as to perform data mining algorithms effectively without compromising the security of sensitive information contained in the data. Current studies of PPDM mainly focus on how to reduce the privacy risk brought by data mining operations, while in fact, unwanted disclosure of sensitive information may also happen in the process of data collecting, data publishing, and information (i.e., the data mining results) delivering. In this paper, we view the privacy issues related to data mining from a wider perspective and investigate various approaches that can help to protect sensitive information. In particular, we identify four different types of users involved in data mining applications, namely, data provider, data collector, data miner, and decision maker. For each type of user, we discuss his privacy concerns and the methods that can be adopted to protect sensitive information. We briefly introduce the basics of related research topics, review state-of-the-art approaches, and present some preliminary thoughts on future research directions. Besides exploring the privacy-preserving approaches for each type of user, we also review the game theoretical approaches, which are proposed for analyzing the interactions among different users in a data mining scenario, each of whom has his own valuation on the sensitive information. By differentiating the responsibilities of different users with respect to security of sensitive information, we would like to provide some useful insights into the study of PPDM.
引用
收藏
页码:1149 / 1176
页数:28
相关论文
共 121 条
[1]  
Abul O, 2008, PROC INT CONF DATA, P376, DOI 10.1109/ICDE.2008.4497446
[2]  
Adl R. Karimi, 2012, NEGOTIATION GAME EST
[3]  
Adl RK, 2012, LECT NOTES COMPUT SC, V7371, P74, DOI 10.1007/978-3-642-31540-4_6
[4]  
Aggarwal Charu C, 2008, A general survey of privacy-preserving data mining models and algorithms
[5]  
Agrawal R., 1993, SIGMOD Record, V22, P207, DOI 10.1145/170036.170072
[6]  
Agrawal R, 2000, SIGMOD REC, V29, P439, DOI 10.1145/335191.335438
[7]   Privacy-Preserving Two-Party k-Means Clustering in Malicious Model [J].
Akhter, Rahena ;
Chowdhury, Rownak Jahan ;
Emura, Keita ;
Islam, Tamzida ;
Rahman, Mohammad Shahriar ;
Rubaiyat, Nusrat .
2013 IEEE 37TH ANNUAL COMPUTER SOFTWARE AND APPLICATIONS CONFERENCE WORKSHOPS (COMPSACW), 2013, :121-126
[8]  
[Anonymous], J INFORM COMPUTATION
[9]  
[Anonymous], 2010, Computing, DOI DOI 10.1145/1749603.1749605
[10]  
[Anonymous], 2002, ACM Sigkdd Explorations Newsletter, DOI [10.1145/772862.772867, DOI 10.1145/772862.772867]