Software systems for tabular data releases

被引:17
作者
Dobra, A
Karr, AF
Sanil, AP
Fienberg, SE
机构
[1] Natl Inst Stat Sci, Res Triangle Pk, NC 27709 USA
[2] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
D O I
10.1142/S0218488502001624
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We describe two classes of software systems that release tabular summaries of an underlying database. Table servers respond to user queries for (marginal) sub-tables of the "full" table summarizing the entire database, and are characterized by dynamic assessment of disclosure risk, in light of previously answered queries, Optimal tabular releases are static releases of sets of sub-tables that are characterized by maximizing the amount of information released, as given by a measure of data utility, subject to a constraint on disclosure risk. Underlying abstractions - primarily associated with the query space, as well as released and unreleasable sub-tables and frontiers, computational algorithms and issues, especially scalability, and prototype software implementations are discussed.
引用
收藏
页码:529 / 544
页数:16
相关论文
共 43 条
[1]  
[Anonymous], 2000, Introduction to Graphical Modelling
[2]  
*AP SOFTW FDN, JAK TOMC
[3]  
Bishop M.M., 1975, DISCRETE MULTIVARIAT
[4]  
Blake C.L., 1998, UCI repository of machine learning databases
[5]  
BUZZIGOLI L, 1999, P C STAT DAT PROT, P131
[7]   Network models for complementary cell suppression [J].
Cox, LH .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1995, 90 (432) :1453-1462
[8]   DATA-SWAPPING - A TECHNIQUE FOR DISCLOSURE CONTROL [J].
DALENIUS, T ;
REISS, SP .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 1982, 6 (01) :73-85
[9]  
DESHPANDE A, 2001, P UAI 2001
[10]   Bounds for cell entries in contingency tables given marginal totals and decomposable graphs [J].
Dobra, A ;
Fienberg, SE .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2000, 97 (22) :11885-11892