Provenance as dependency analysis

被引:29
作者
Cheney, James [1 ]
Ahmed, Amal [2 ]
Acar, Umut A. [3 ]
机构
[1] Univ Edinburgh, Lab Fdn Comp Sci, Edinburgh EH8 9AB, Midlothian, Scotland
[2] Indiana Univ, Sch Informat & Comp, Bloomington, IN 47405 USA
[3] Max Planck Inst Software Syst, D-67663 Kaiserslautern, Germany
基金
英国工程与自然科学研究理事会;
关键词
INFORMATION-FLOW; LINEAGE;
D O I
10.1017/S0960129511000211
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Provenance is information recording the source, derivation or history of some information. Provenance tracking has been studied in a variety of settings, particularly database management systems. However, although many candidate definitions of provenance have been proposed, the mathematical or semantic foundations of data provenance have received comparatively little attention. In this paper, we argue that dependency analysis techniques familiar from program analysis and program slicing provide a formal foundation for forms of provenance that are intended to show how (part of) the output of a query depends on (parts of) its input. We introduce a semantic characterisation of such dependency provenance for a core database query language, show that minimal dependency provenance is not computable, and provide dynamic and static approximation techniques. We also discuss preliminary implementation experience with using dependency provenance to compute data slices, or summaries of the parts of the input relevant to a given part of the output.
引用
收藏
页码:1301 / 1337
页数:37
相关论文
共 46 条
[1]  
Abadi M., 1999, Conference Record of POPL '99. 26th ACM SIGPLAN-SIGACT. Symposium on Principles of Programming Languages, P147, DOI 10.1145/292540.292555
[2]   Analysis and caching of dependencies [J].
Abadi, M ;
Lampson, B ;
Levy, JJ .
ACM SIGPLAN NOTICES, 1996, 31 (06) :83-91
[3]  
Abiteboul Serge, 1995, FDN DATABASES, DOI DOI 10.5555/551350
[4]   Selective memoization [J].
Acar, UA ;
Blelloch, GE ;
Harper, R .
ACM SIGPLAN NOTICES, 2003, 38 (01) :14-25
[5]   Imperative Self-Adjusting Computation [J].
Acar, Umut A. ;
Ahmed, Amal ;
Blume, Matthias .
POPL'08: PROCEEDINGS OF THE 35TH ANNUAL ACM SIGPLAN-SIGACT SYMPOSIUM ON PRINCIPLES OF PROGRAMMING LANGUAGES, 2008, :309-322
[6]  
Acar UmutA., 2009, Proceedings of the 2009 ACM SIGPLAN Workshop on Partial Evaluation and Program Manipulation, P1
[7]  
[Anonymous], 1999, P 26 ACM SIGPLAN SIG
[8]  
Benjelloun Omar., 2006, VLDB
[9]   An annotation management system for relational databases [J].
Bhagwat, D ;
Chiticariu, L ;
Tan, WC ;
Vijayvargiya, G .
VLDB JOURNAL, 2005, 14 (04) :373-396
[10]  
Biswas S., 1997, THESIS U PENNSYLVANI