An evaluation of the current state of genomic data privacy protection technology and a roadmap for the future

被引:69
作者
Malin, BA [1 ]
机构
[1] Carnegie Mellon Univ, Inst Software Res Int, Sch Comp Sci, Data Privacy Lab, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会; 美国安德鲁·梅隆基金会;
关键词
D O I
10.1197/jamia.M1603
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The incorporation of genomic data into personal medical records poses many challenges to patient privacy. In response, various systems for preserving patient privacy in shared genomic data have been developed and deployed. Although these systems de-identify the data by removing explicit identifiers (e.g., name, address, or Social Security number) and incorporate sound security design principles, they suffer from a lack of formal modeling of inferences learnable from shared data. This report evaluates the extent to which current protection systems are capable of withstanding a range of re-identification methods, including genotype-phenotype inferences, location-visit patterns, family structures, and dictionary attacks. For a comparative re-identification analysis, the systems are mapped to a common formalism. Although there is variation in susceptibility, each system is deficient in its protection capacity. The author discovers patterns of protection failure and discusses several of the reasons why these systems are susceptible. The analyses and discussion within provide guideposts for the development of next-generation protection methods amenable to formal proofs.
引用
收藏
页码:28 / 34
页数:7
相关论文
共 27 条
[1]  
Agrawal D., 2001, Proceedings of the 20th ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems, P247, DOI DOI 10.1145/375551.375602
[2]   Challenges for biomedical informatics and pharmacogenomics [J].
Altman, RB ;
Klein, TE .
ANNUAL REVIEW OF PHARMACOLOGY AND TOXICOLOGY, 2002, 42 :113-133
[3]  
Altman RB, 1998, J AM MED INFORM ASSN, P53
[4]  
[Anonymous], 2000, Privacy-preserving data mining, DOI DOI 10.1145/342009.335438
[5]  
Burnett Leslie, 2003, J Law Med, V10, P506
[6]   A proposed architecture and method of operation for improving the protection of privacy and confidentiality in disease registers [J].
Tim Churches .
BMC Medical Research Methodology, 3 (1) :1-13
[7]  
De Moor GJE, 2003, METHOD INFORM MED, V42, P148
[8]  
Domingo-Ferrer J., 2002, CONFIDENTIALITY DISC, P93
[9]  
DUNCAN GT, 1997, IRS METHODOLOGY REPO, V5, P223
[10]   Procedure to protect confidentiality of familial data in community genetics and genomic research [J].
Gaudet, D ;
Arsenault, S ;
Bélanger, C ;
Hudson, T ;
Perron, P ;
Bernard, M ;
Hamet, P .
CLINICAL GENETICS, 1999, 55 (04) :259-264