Microdata protection method through microaggregation: A systematic approach

被引:2
作者
Kabir, Md Enamul [1 ]
Wang, Hua [2 ]
机构
[1] School of Engineering and Information Technology, University of New South Wales at the Australian Defence Force Academy, Northcott Drive
[2] Department of Mathematics and Computing, University of Southern Queensland, Toowoomba
关键词
Disclosure control; K-anonymity; Microaggregation; Microdata protection; Privacy;
D O I
10.4304/jsw.7.11.2415-2423
中图分类号
学科分类号
摘要
Microdata protection in statistical databases has recently become a major societal concern and has been intensively studied in recent years. Statistical Disclosure Control (SDC) is often applied to statistical databases before they are released for public use. Microaggregation for SDC is a family of methods to protect microdata from individual identification. SDC seeks to protect microdata in such a way that can be published and mined without providing any private information that can be linked to specific individuals. Microaggregation works by partitioning the microdata into groups of at least k records and then replacing the records in each group with the centroid of the group. This paper presents a clustering-based microaggregation method to minimize the information loss. The proposed technique adopts to group similar records together in a systematic way and then anonymized with the centroid of each group individually. The structure of systematic clustering problem is defined and investigated and an algorithm of the proposed problem is developed. Experimental results show that our method attains a reasonable dominance with respect to both information loss and execution time than the most popular heuristic algorithm called Maximum Distance to Average Vector (MDAV). © 2012 ACADEMY PUBLISHER.
引用
收藏
页码:2415 / 2423
页数:8
相关论文
共 30 条
[1]  
Bezdek J.C., Pattern Recognition With Fuzzy Objective Function Algorithms, (1981)
[2]  
Cormen T.H., Leiserson C.E., Rivest R.L., Stein C., Introduction to Algorithms, (2001)
[3]  
Domingo-Ferrer J., Torra V., Privacy in data mining, Data Mining and Knowledge Discovery, 11, 2, pp. 117-119, (2005)
[4]  
Domingo-Ferrer J., Mateo-Sanz J., Practical dataoriented microaggregation for statistical disclosure control, IEEE Transactions On Knowledge and Data Engineering, 14, 1, pp. 189-201, (2002)
[5]  
Domingo-Ferrer J., Torra V., Extending microaggregation procedures using defuzzification methods for categorical variables, Proc. 1st International IEEE Symposium On Intelligent Systems, pp. 44-49, (2002)
[6]  
Domingo-Ferrer J., Torra V., Towards fuzzy c-means based microaggregation, Soft Methods in Probability, Statistics and Data Analysis, 16, pp. 289-294, (2002)
[7]  
Domingo-Ferrer J., Torra V., Fuzzy microaggregation for microdata protection, Journal of Advanced Computational Intelligence and Intelligent Informatics, 7, 2, pp. 153-159, (2003)
[8]  
Domingo-Ferrer J., Torra V., Ordinal, continuous and heterogeneous kanonymity through microaggregation, Data Mining and Knowledge Discovery, 11, 2, pp. 195-212, (2005)
[9]  
Domingo-Ferrer J., Martinez-Balleste A., Mateo-Sanz J.M., Sebe F., Efficient multivariate data-oriented microaggregation, The VLDB Journal, 15, 4, pp. 355-369, (2006)
[10]  
Domingo-Ferrer J., Sebe F., Solanas A., A polynomialtime approximation to optimal multivariate microaggregation, Computer and Mathematics with Applications, 55, 4, pp. 714-732, (2008)