Search, Adapt, and Reuse: The Future of Scientific Workflows

被引:31
作者
Cohen-Boulakia, Sarah [1 ]
Leser, Ulf [2 ]
机构
[1] Univ Paris 11, CNRS, UMR 8623, AMIB INRIA Saclay, Paris, France
[2] Univ Berlin, D-10099 Berlin, Germany
关键词
Scientific Workflow Systems; Workflow Management; Scientific data; Data Analysis; GENE-EXPRESSION; FEATURES;
D O I
10.1145/2034863.2034865
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Over the last years, a number of scientific workflow management systems (SciWFM) have been brought to a state of maturity that should permit their usage in a production-style environment. This is especially true for the Life Sciences, but SciWFM also attract considerable attention in fields like geophysics or climate research. These developments, accompanied by the growing availability of analytical tools wrapped as (web) services, were driven by a series of very interesting promises: End users will be empowered to develop their own pipelines; reuse of services will be enhanced by easier integration into custom workflows; time necessary for developing analysis pipelines will decrease; etc. But despite all efforts, SciWFM have not yet found widespread acceptance in their intended audience. In this paper, we argue that a wider adoption of SciWFM will only be achieved if the focus of research and development is shifted from methods for developing and running workflows to searching, adapting, and reusing existing workflows. Only by this shift can SciWFM outreach to the mass of domain scientists actually performing scientific analysis - and with little interest in developing them themselves. To this end, SciWFM need to be combined with community-wide workflow repositories allowing users to find solutions for their scientific needs (coded as a workflow). In this vision paper, we show how and where such developments have already started and highlight new research questions arising.
引用
收藏
页码:6 / 16
页数:11
相关论文
共 54 条
[1]   Managing Scientific Data [J].
Ailamaki, Anastasia ;
Kantere, Verena ;
Dash, Debabrata .
COMMUNICATIONS OF THE ACM, 2010, 53 (06) :68-78
[2]  
Albrecht A., 2009, VLDB PHD WORKSH
[3]  
[Anonymous], 2009, ADV ARTIFICIAL INTEL
[4]  
[Anonymous], ENCY DATABASE SYSTEM
[5]  
[Anonymous], 1997, ACM SIGACT NEWS
[6]  
AUMUELLER D, 2005, SIGMOD C BALT US
[7]  
AWAD A, 2010, DASFAA WORKSH TSUK J
[8]  
BAO Z, 2009, INT C DAT ENG SHANGH
[9]   Querying business processes with BP-QL [J].
Beeri, Catriel ;
Eyal, Anat ;
Kamenkovich, Simon ;
Milo, Tova .
INFORMATION SYSTEMS, 2008, 33 (06) :477-507
[10]   BioCatalogue: a universal catalogue of web services for the life sciences [J].
Bhagat, Jiten ;
Tanoh, Franck ;
Nzuobontane, Eric ;
Laurent, Thomas ;
Orlowski, Jerzy ;
Roos, Marco ;
Wolstencroft, Katy ;
Aleksejevs, Sergejs ;
Stevens, Robert ;
Pettifer, Steve ;
Lopez, Rodrigo ;
Goble, Carole A. .
NUCLEIC ACIDS RESEARCH, 2010, 38 :W689-W694