Workflows and e-Science: An overview of workflow system features and capabilities

被引:472
作者
Deelman, Ewa [2 ]
Gannon, Dennis [3 ]
Shields, Matthew [1 ]
Taylor, Ian [1 ]
机构
[1] Cardiff Univ, Sch Comp Sci, Cardiff, S Glam, Wales
[2] Univ So Calif, Inst Informat Sci, Los Angeles, CA 90089 USA
[3] Indiana Univ, Dept Comp Sci, Bloomington, IN 47405 USA
来源
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE | 2009年 / 25卷 / 05期
基金
英国科学技术设施理事会; 美国国家科学基金会;
关键词
Scientific workflow; Grid computing; Computation; Web services; Distributed computing; Distributed systems; Cyberinfrastructure; Automation of scientific processes; e-Science;
D O I
10.1016/j.future.2008.06.012
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Scientific workflow systems have become a necessary tool for many applications, enabling the composition and execution of complex analysis on distributed resources. Today there are many workflow systems, often with overlapping functionality. A key issue for potential users of workflow systems is the need to be able to compare the capabilities of the various available tools. There can be confusion about system functionality and the tools are often selected without a proper functional analysis. In this paper we extract a taxonomy of features from the way scientists make use of existing workflow systems and we illustrate this feature set by providing some examples taken from existing workflow systems. The taxonomy provides end users with a mechanism by which they can assess the suitability of workflow in general and how they might use these features to make an informed choice about which workflow system Would be a good choice for their particular application. Crown Copyright (C) 2008 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:528 / 540
页数:13
相关论文
共 91 条
[1]   Web services composition for distributed data mining [J].
Ali, AS ;
Rana, OF ;
Taylor, IJ .
2005 INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING WORKSHOPS, PROCEEDINGS, 2005, :11-18
[2]  
ALLAN R, P UK E SCI ALL HANDS
[3]   Enabling applications on the grid: A Gridlab overview [J].
Allen, G ;
Davis, K ;
Dolkas, KN ;
Doulamis, ND ;
Goodale, T ;
Kielmann, T ;
Merzky, A ;
Nabrzyski, J ;
Pukacki, J ;
Radke, T ;
Russell, M ;
Seidel, E ;
Shalf, J ;
Taylor, I .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2003, 17 (04) :449-466
[4]   The cactus worm: Experiments with dynamic resource discovery and allocation in a grid environment [J].
Allen, G ;
Angulo, D ;
Foster, I ;
Lanfermann, G ;
Liu, C ;
Radke, T ;
Seidel, E ;
Shalf, J .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (04) :345-358
[5]  
Alt M, 2006, LECT NOTES COMPUT SC, V3911, P715
[6]  
Altintas I, 2004, 16TH INTERNATIONAL CONFERENCE ON SCIENTIFIC AND STATISTICAL DATABASE MANAGEMENT, PROCEEDINGS, P423
[7]  
ANDREWS T, 2001, BUSINESS PROCESS EXE
[8]  
[Anonymous], WEB SERV DESCR LANG
[9]  
[Anonymous], 2006, Proc. Special Interest Group on Management of Data Conf. (SIGMOD '06), DOI [10.1145/1142473.1142574, DOI 10.1145/1142473.1142574]
[10]  
[Anonymous], 2010, The myExperiment Project workflows