noWorkflow: Capturing and Analyzing Provenance of Scripts

被引:65
作者
Murta, Leonardo [1 ]
Braganholo, Vanessa [1 ]
Chirigati, Fernando [2 ]
Koop, David [2 ]
Freire, Juliana [2 ]
机构
[1] Univ Fed Fluminense, Niteroi, RJ, Brazil
[2] NYU, New York, NY USA
来源
PROVENANCE AND ANNOTATION OF DATA AND PROCESSES (IPAW 2014) | 2015年 / 8628卷
关键词
D O I
10.1007/978-3-319-16462-5_6
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We propose noWorkflow, a tool that transparently captures provenance of scripts and enables reproducibility. Unlike existing approaches, noWorkflow is non-intrusive and does not require users to change the way they work - users need not wrap their experiments in scientific workflow systems, install version control systems, or instrument their scripts. The tool leverages Software Engineering techniques, such as abstract syntax tree analysis, reflection, and profiling, to collect different types of provenance, including detailed information about the underlying libraries. We describe how noWorkflow captures multiple kinds of provenance and the different classes of analyses it supports: graph-based visualization; differencing over provenance trails; and inference queries.
引用
收藏
页码:71 / 83
页数:13
相关论文
共 19 条
[1]  
Bochner C, 2008, LECT NOTES COMPUT SC, V5272, P229, DOI 10.1007/978-3-540-89965-5_24
[2]   Provenance as dependency analysis [J].
Cheney, James ;
Ahmed, Amal ;
Acar, Umut A. .
MATHEMATICAL STRUCTURES IN COMPUTER SCIENCE, 2011, 21 (06) :1301-1337
[3]   Automated Capture of Experiment Context for Easier Reproducibility in Computational Research [J].
Davison, Andrew P. .
COMPUTING IN SCIENCE & ENGINEERING, 2012, 14 (04) :48-56
[4]  
Diehl S., 2007, Software Visualization: Visualizing the Structure, Behaviour, and Evolution of Software
[5]   Provenance for computational tasks: A survey [J].
Freire, Juliana ;
Koop, David ;
Santos, Emanuele ;
Silva, Claudio T. .
COMPUTING IN SCIENCE & ENGINEERING, 2008, 10 (03) :11-21
[6]   Automatic capture and reconstruction of computational provenance [J].
Frew, James ;
Metzger, Dominic ;
Slaughter, Peter .
CONCURRENCY AND COMPUTATION-PRACTICE & EXPERIENCE, 2008, 20 (05) :485-496
[7]   A Universal Identifier for Computational Results [J].
Gavish, Matan ;
Donoho, David .
PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON COMPUTATIONAL SCIENCE (ICCS), 2011, 4 :637-647
[8]  
Guo Philip J., 2012, 4 USENIX WORKSH THEO
[9]  
Huq M.R., 2013, PROC 16 INT C EXTEND, P765, DOI [10.1145/2452376.2452475, DOI 10.1145/2452376.2452475]
[10]  
Koop D, 2013, IEEE PAC VIS SYMP, P57, DOI 10.1109/PacificVis.2013.6596128