openPDS: Protecting the Privacy of Metadata through SafeAnswers

被引:118
作者
de Montjoye, Yves-Alexandre [1 ]
Shmueli, Erez [1 ]
Wang, Samuel S. [2 ]
Pentland, Alex Sandy [1 ]
机构
[1] MIT, Media Lab, Cambridge, MA 02139 USA
[2] MIT, DIG CSAIL, Cambridge, MA 02139 USA
来源
PLOS ONE | 2014年 / 9卷 / 07期
关键词
LOCATION PRIVACY; HUMAN MOBILITY; ANONYMIZATION; ANONYMITY; SYSTEMS; MODEL;
D O I
10.1371/journal.pone.0098790
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The rise of smartphones and web services made possible the large-scale collection of personal metadata. Information about individuals' location, phone call logs, or web-searches, is collected and used intensively by organizations and big data researchers. Metadata has however yet to realize its full potential. Privacy and legal concerns, as well as the lack of technical solutions for personal metadata management is preventing metadata from being shared and reconciled under the control of the individual. This lack of access and control is furthermore fueling growing concerns, as it prevents individuals from understanding and managing the risks associated with the collection and use of their data. Our contribution is two-fold: (1) we describe openPDS, a personal metadata management framework that allows individuals to collect, store, and give fine-grained access to their metadata to third parties. It has been implemented in two field studies; (2) we introduce and analyze SafeAnswers, a new and practical way of protecting the privacy of metadata at an individual level. SafeAnswers turns a hard anonymization problem into a more tractable security one. It allows services to ask questions whose answers are calculated against the metadata instead of trying to anonymize individuals' metadata. The dimensionality of the data shared with the services is reduced from high-dimensional metadata to low-dimensional answers that are less likely to be re-identifiable and to contain sensitive information. These answers can then be directly shared individually or in aggregate. openPDS and SafeAnswers provide a new way of dynamically protecting personal metadata, thereby supporting the creation of smart data-driven services and data science research.
引用
收藏
页数:9
相关论文
共 62 条
[31]  
Kearns M., 2007, ADV NEURAL INFORM PR, V20
[32]   SOCIAL SCIENCE Computational Social Science [J].
Lazer, David ;
Pentland, Alex ;
Adamic, Lada ;
Aral, Sinan ;
Barabasi, Albert-Laszlo ;
Brewer, Devon ;
Christakis, Nicholas ;
Contractor, Noshir ;
Fowler, James ;
Gutmann, Myron ;
Jebara, Tony ;
King, Gary ;
Macy, Michael ;
Roy, Deb ;
Van Alstyne, Marshall .
SCIENCE, 2009, 323 (5915) :721-723
[33]   Closeness: A New Privacy Measure for Data Publishing [J].
Li, Ninghui ;
Li, Tiancheng ;
Venkatasubramanian, Suresh .
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2010, 22 (07) :943-956
[34]  
Machanavajjhala A., 2006, ACM Trans. Knowl. Discov ery Data, P24, DOI [DOI 10.1109/ICDE.2006.1, DOI 10.1145/1217299.1217302]
[35]   Privacy in geo-social networks: proximity notification with untrusted service providers and curious buddies [J].
Mascetti, Sergio ;
Freni, Dario ;
Bettini, Claudio ;
Wang, X. Sean ;
Jajodia, Sushil .
VLDB JOURNAL, 2011, 20 (04) :541-566
[36]  
Mir Darakhshan J., 2013, 2013 IEEE International Conference on Big Data, P580, DOI 10.1109/BigData.2013.6691626
[37]  
Monreale A, 2010, TRANS DATA PRIV, V3, P91
[38]  
Mui L., 2002, Proceedings of the 35th Annual Hawaii International Conference on System Sciences, P2431, DOI 10.1109/HICSS.2002.994181
[39]  
Mun Min Y., 2010, CONEXT, P17
[40]   Robust de-anonymization of large sparse datasets [J].
Narayanan, Arvind ;
Shmatikov, Vitaly .
PROCEEDINGS OF THE 2008 IEEE SYMPOSIUM ON SECURITY AND PRIVACY, 2008, :111-125