Automatic capture and efficient storage of e-Science experiment provenance

被引:35
作者
Barga, Roger S. [1 ]
Digiampietri, Luciano A. [2 ]
机构
[1] Microsoft Res, Redmond, WA 98052 USA
[2] Univ Estadual Campinas, Inst Comp, Sao Paulo, Brazil
关键词
provenance model; automatic provenance capture; efficient provenance storage;
D O I
10.1002/cpe.1235
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
For the first provenance challenge, we introduce a layered model to represent workflow provenance that allows navigation from an abstract model of the experiment to instance data collected during a specific experiment run. We outline modest extensions to a commercial workflow engine so it will automatically capture provenance at workflow runtime. We also present an approach to store this provenance data in a relational database. Finally, we demonstrate how core provenance queries in the challenge can be expressed in SQL and discuss the merits of our layered representation. Copyright (C) 2007 John Wiley & Sons, Ltd.
引用
收藏
页码:419 / 429
页数:11
相关论文
共 11 条
[1]  
ANDREW P, 2006, PRESENTING WINDOWS W
[2]  
BOWERS S, 2007, CONCURRENCY COMPUTAT
[3]  
KIM J, 2007, CONCURENCY COMPUTATI
[4]  
KRENEK A, 2007, CONCURRENCY COMPUTAT
[6]  
MILES S, 2007, PRACTICE EXPERIENCE
[7]  
Pastorello GZ, 2005, LECT NOTES COMPUT SC, V3534, P100
[8]  
SCHEIDEGGER C, 2007, CONCURRENCY COMPUTAT
[9]  
ZHAO J, 2007, CONCURRENCY COMPUTAT
[10]  
[No title captured]