Data dissemination and disclosure limitation in a world without microdata: A risk-utility framework for remote access analysis servers

被引:54
作者
Gomatam, S [1 ]
Karr, AF
Reiter, JP
Sanil, AP
机构
[1] US FDA, Rockville, MD 20850 USA
[2] Natl Inst Stat Sci, Res Triangle Pk, NC 27709 USA
[3] Duke Univ, Inst Stat & Decis Sci, Durham, NC 27708 USA
[4] Natl Inst Stat Sci, Res Triangle Pk, NC 27709 USA
关键词
data confidentiality; data utility; disclosure risk; microdata; regression server; remote access server; statistical disclosure limitation;
D O I
10.1214/08834230500000043
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Given the public's ever-increasing concerns about data confidentiality, in the near future statistical agencies may be unable or unwilling, or even may not be legally allowed, to release any genuine microdata-data on individual units, such as individuals or establishments. In such a world, an alternative dissemination strategy is remote access analysis servers, to which users submit requests for output from statistical models fit using the data, but are not allowed access to the data themselves. Analysis servers, however, are not free from the risk of disclosure, especially in the face of multiple, interacting queries. We describe these risks and propose quantifiable measures of risk and data utility that can be used to specify which queries can be answered and with what output. The risk-utility framework is illustrated for regression models.
引用
收藏
页码:163 / 177
页数:15
相关论文
共 40 条
[1]   DATA-SWAPPING - A TECHNIQUE FOR DISCLOSURE CONTROL [J].
DALENIUS, T ;
REISS, SP .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1982, 6 (01) :73-85
[2]   Preserving confidentiality of high-dimensional tabulated data: Statistical and computational issues [J].
Dobra, A ;
Karr, AF ;
Sanil, AP .
STATISTICS AND COMPUTING, 2003, 13 (04) :363-370
[3]   Software systems for tabular data releases [J].
Dobra, A ;
Karr, AF ;
Sanil, AP ;
Fienberg, SE .
INTERNATIONAL JOURNAL OF UNCERTAINTY FUZZINESS AND KNOWLEDGE-BASED SYSTEMS, 2002, 10 (05) :529-544
[4]  
DOMINGOFERRER J, 2001, UNECE WORKSH STAT DA
[5]   Optimal disclosure limitation strategy in statistical databases: Deterring tracker attacks through additive noise [J].
Duncan, GT ;
Mukherjee, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2000, 95 (451) :720-729
[6]  
DUNCAN GT, 1986, J AM STAT ASSOC, V81, P10, DOI 10.2307/2287959
[7]  
DUNCAN GT, 2002, UNPUB DISCLOSURE RIS
[8]  
DUNCAN GT, 1993, J OFF STAT, V9, P271
[9]  
*FED COMM STAT MET, 1994, REP STAT DISCL LIM M
[10]  
Fienberg SE., 1998, J OFF STAT, V14, P485