Anonymizing Bipartite Graph Data using Safe Groupings

被引:63
作者
Cormode, Graham [1 ]
Srivastava, Divesh [1 ]
Yu, Ting [2 ]
Zhang, Qing [2 ]
机构
[1] AT&T Labs Res, Florham Pk, NJ 07932 USA
[2] North Carolina State Univ, Raleigh, NC 27695 USA
来源
PROCEEDINGS OF THE VLDB ENDOWMENT | 2008年 / 1卷 / 01期
关键词
D O I
10.14778/1453856.1453947
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Private data often comes in the form of associations between entities, such as customers and products bought from a pharmacy, which are naturally represented in the form of a large, sparse bipartite graph. As with tabular data, it is desirable to be able to publish anonymized versions of such data, to allow others to perform ad hoc analysis of aggregate graph properties. However, existing tabular anonymization techniques do not give useful or meaningful results when applied to graphs: small changes or masking of the edge structure can radically change aggregate graph properties. We introduce a new family of anonymizations, for bipartite graph data, called (k, l)-groupings. These groupings preserve the underlying graph structure perfectly, and instead anonymize the mapping from entities to nodes of the graph. We identify a class of "safe" (k, l)-groupings that have provable guarantees to resist a variety of attacks, and show how to find such safe groupings. We perform experiments on real bipartite graph data to study the utility of the anonymized version, and the impact of publishing alternate groupings of the same graph data. Our experiments demonstrate that (k, l)-groupings offer strong tradeoffs between privacy and utility.
引用
收藏
页码:833 / 844
页数:12
相关论文
共 18 条
[1]  
Backstrom L., 2007, WWW
[2]  
Bennett J., 2007, KDDCUP
[3]  
Garey M.R., 1979, COMPUTERS INTRACTABI
[4]  
Ghinita G, 2008, ICDE
[5]  
Hay M., 2007, 0719 U MASS AM
[6]  
Korolova A., 2008, ICDE
[7]  
Li N., 2007, ICDE
[8]  
Machanavajjhala A., 2006, IEEE ICDE
[9]  
Martin D., 2007, ICDE
[10]  
Nergiz M. E., 2007, ICDE