Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing

被引:236
作者
Beaulieu-Jones, Brett K. [1 ]
Wu, Zhiwei Steven [3 ]
Williams, Chris [2 ]
Lee, Ran [4 ]
Bhavnani, Sanjeev P. [5 ]
Byrd, James Brian [4 ]
Greene, Casey S. [2 ]
机构
[1] Univ Penn, Perelman Sch Med, Genom & Computat Biol Grad Grp, Philadelphia, PA 19104 USA
[2] Univ Penn, Perelman Sch Med, Dept Syst Pharmacol & Translat Therapeut, Philadelphia, PA 19104 USA
[3] Univ Minnesota, Comp Sci & Elect Engn Dept, Minneapolis, MN USA
[4] Univ Michigan, Sch Med, Dept Med, Div Cardiovasc Med, Ann Arbor, MI 48104 USA
[5] Scripps Res Inst, San Diego, CA USA
来源
CIRCULATION-CARDIOVASCULAR QUALITY AND OUTCOMES | 2019年 / 12卷 / 07期
基金
美国国家卫生研究院;
关键词
blood pressure; deep learning; machine learning; privacy; propensity score;
D O I
10.1161/CIRCOUTCOMES.118.005122
中图分类号
R5 [内科学];
学科分类号
100201 [内科学];
摘要
Background: Data sharing accelerates scientific progress but sharing individual-level data while preserving patient privacy presents a barrier. Methods and Results: Using pairs of deep neural networks, we generated simulated, synthetic participants that closely resemble participants of the SPRINT trial (Systolic Blood Pressure Trial). We showed that such paired networks can be trained with differential privacy, a formal privacy framework that limits the likelihood that queries of the synthetic participants' data could identify a real a participant in the trial. Machine learning predictors built on the synthetic population generalize to the original data set. This finding suggests that the synthetic data can be shared with others, enabling them to perform hypothesis-generating analyses as though they had the original trial data. Conclusions: Deep neural networks that generate synthetic participants facilitate secondary analyses and reproducible investigation of clinical data sets by enhancing data sharing while preserving participant privacy.
引用
收藏
页数:10
相关论文
共 22 条
[1]
Deep Learning with Differential Privacy [J].
Abadi, Martin ;
Chu, Andy ;
Goodfellow, Ian ;
McMahan, H. Brendan ;
Mironov, Ilya ;
Talwar, Kunal ;
Zhang, Li .
CCS'16: PROCEEDINGS OF THE 2016 ACM SIGSAC CONFERENCE ON COMPUTER AND COMMUNICATIONS SECURITY, 2016, :308-318
[2]
The US Census Bureau Adopts Differential Privacy [J].
Abowd, John M. .
KDD'18: PROCEEDINGS OF THE 24TH ACM SIGKDD INTERNATIONAL CONFERENCE ON KNOWLEDGE DISCOVERY & DATA MINING, 2018, :2867-2867
[3]
Aggarwal R., 2017, Assessing the Impact of Intensive Blood Pressure Management in Chronic Kidney Disease Patients
[4]
[Anonymous], TREAT INTENSIVELY NO
[5]
[Anonymous], PRIV LAW SCHOL C
[6]
[Anonymous], CENS SCI ADV M
[7]
[Anonymous], GANS SEQUENCES DISCR
[8]
[Anonymous], MODERNIZING DISCLOSU
[9]
[Anonymous], SOURCE CODE PRIVACY
[10]
[Anonymous], DEV VALIDATION CLIN