Mapping abstract complex workflows onto grid environments

被引:32
作者
Ewa Deelman
James Blythe
Yolanda Gil
Carl Kesselman
Gaurang Mehta
Karan Vahi
Kent Blackburn
Albert Lazzarini
Adam Arbree
Richard Cavanaugh
Scott Koranda
机构
[1] Information Sciences Institute, University of Southern California, Marina Del Rey
[2] California Institute of Technology, Pasadena, CA 9112
[3] Department of Physics, University of Florida, Gainesville
[4] Department of Physics, University of Wisconsin, Milwaukee, WI 53211, Milwaukee 1900, East Kenwood Blvd
基金
美国国家科学基金会;
关键词
Complex applications; Planning; Reliability; Workflow management;
D O I
10.1023/A:1024000426962
中图分类号
学科分类号
摘要
In this paper we address the problem of automatically generating job workflows for the Grid. These workflows describe the execution of a complex application built from individual application components. In our work we have developed two workflow generators: the first (the Concrete Workflow Generator CWG) maps an abstract workflow defined in terms of application-level components to the set of available Grid resources. The second generator (Abstract and Concrete Workflow Generator, ACWG) takes a wider perspective and not only performs the abstract to concrete mapping but also enables the construction of the abstract workflow based on the available components. This system operates in the application domain and chooses application components based on the application metadata attributes. We describe our current ACWG based on AI planning technologies and outline how these technologies can play a crucial role in developing complex application workflows in Grid environments. Although our work is preliminary, CWG has already been used to map high energy physics applications onto the Grid. In one particular experiment, a set of production runs lasted 7 days and resulted in the generation of 167,500 events by 678 jobs. Additionally, ACWG was used to map gravitational physics workflows, with hundreds of nodes onto the available resources, resulting in 975 tasks, 1365 data transfers and 975 output files produced. © 2003 Kluwer Academic Publishers.
引用
收藏
页码:25 / 39
页数:14
相关论文
共 46 条
  • [21] Foster I., Kesselman C., Et al., The Anatomy of the Grid: Enabling Scalable Virtual Organizations, International Journal of High Performance Computing Applications, 15, pp. 200-222, (2001)
  • [22] Foster I., Voeckler J., Et al., Chimera: A Virtual Data System for Representing, Querying, and Automating Data Derivation, Presented At Scientific and Statistical Database Management, (2002)
  • [23] Foster I., Voeckler J., Et al., Chimera: A Virtual Data system for Representing, Querying, and Automating data Derivation, Presented At 14th International Conference On Scientific and Statistical Database Management (SSDBM 2002), (2002)
  • [24] Foster I., Kesselman C., Et al., Grid Services for Distributed System Integration, Computer, 35, (2002)
  • [25] Foster I., Kesselman C., Et al., The Physiology of the Grid: An Open Grid Services Architecture For Distributed Systems Integration, (2002)
  • [26] Frey J., Tannenbaum T., Et al., Condor-G: A Computation Management Agent for Multi-Institutional Grids, Cluster Computing, 5, pp. 237-246, (2002)
  • [27] Giacomini F., Prelz F., Definition of Architecture, Technical Plan and Evaluation Criteria For Scheduling, Resource Management, Security and Job Description, (2001)
  • [28] Gil Y., Blythe J., PLANET: A Shareable and Reusable Ontology for Representing Plans, Presented At AAAI Workshop On Representational Issues For Real-World Planning Systems, (2000)
  • [29] Hammond K.J., Case-Based Planning: An Integrated Theory of Planning, Learning and Memory, (1986)
  • [30] Holtman K., CMS Data Grid System Overview and Requirements, (2001)