Small Is Beautiful Summarizing Scientific Workflows Using Semantic Annotations

被引:11
作者
Alper, Pinar [1 ]
Belhajjame, Khalid [1 ]
Goble, Carole [1 ]
Karagoz, Pinar [2 ]
机构
[1] Univ Manchester, Sch Comp Sci, Manchester, Lancs, England
[2] METU, Dept Comp Engn, Ankara, Turkey
来源
2013 IEEE INTERNATIONAL CONGRESS ON BIG DATA | 2013年
基金
英国工程与自然科学研究理事会;
关键词
Scientific Workflow; Annotation; Rule-Based Summarization; Motif; PROVENANCE; VIEWS;
D O I
10.1109/BigData.Congress.2013.49
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scientific Workflows have become the workhorse of BigData analytics for scientists. As well as being repeatable and optimizable pipelines that bring together datasets and analysis tools, workflows make-up an important part of the provenance of data generated from their execution. By faithfully capturing all stages in the analysis, workflows play a critical part in building up the audit-trail (a.k.a. provenance) meta-data for derived datasets and contributes to the veracity of results. Provenance is essential for reporting results, reporting the method followed, and adapting to changes in the datasets or tools. These functions, however, are hampered by the complexity of workflows and consequently the complexity of data-trails generated from their instrumented execution. In this paper we propose the generation of workflow description summaries in order to tackle workflow complexity. We elaborate reduction primitives for summarizing workflows, and show how primitives, as building blocks, can be used in conjunction with semantic workflow annotations to encode different summarization strategies. We report on the effectiveness of the method through experimental evaluation using real-world workflows from the Taverna system.
引用
收藏
页码:318 / 325
页数:8
相关论文
共 10 条
  • [1] Provenance Browser: Displaying and Querying Scientific Workflow Provenance Graphs
    Anand, Manish Kumar
    Bowers, Shawn
    Ludaescher, Bertram
    [J]. 26TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING ICDE 2010, 2010, : 1201 - 1204
  • [2] Belhajjame K., 2012, SEPUBLICA
  • [3] Querying and managing provenance through user views in scientific workflows
    Biton, Olivier
    Cohen-Boulakia, Sarah
    Davidson, Susan B.
    Hara, Carmern S.
    [J]. 2008 IEEE 24TH INTERNATIONAL CONFERENCE ON DATA ENGINEERING, VOLS 1-3, 2008, : 1072 - +
  • [4] Cadenhead T., 2011, P 16 ACM S ACC CONTR, P93
  • [5] Cheung K, 2006, LECT NOTES COMPUT SC, V4273, P215
  • [6] Copeland M., 2012, ICBO
  • [7] Davidson S.B., 2008, SIGMOD
  • [8] Dey Saumen C., 2011, Scientific and Statistical Database Management. Proceedings 23rd International Conference, SSDBM 2011, P225, DOI 10.1007/978-3-642-22351-8_13
  • [9] Garijo D., 2012, ESCIENCE
  • [10] Hull D, 2004, AKT WORKSH SEM WEB S