Analysis of scheduling and replica Optimisation strategies for data grids using optorsim

被引:7
作者
Cameron D.G. [1 ]
Millar A.P. [1 ]
Nicholson C. [1 ]
Carvajal-Schiaffino R. [2 ]
Stockinger K. [3 ]
Zini F. [2 ]
机构
[1] University of Glasgow, Glasgow
[2] ITC-irst, 38050 Povo (Trento)
[3] CERN, European Organization for Nuclear Research
关键词
Data grid; Data replication; Optimisation; Scheduling; Simulator;
D O I
10.1007/s10723-004-6040-6
中图分类号
学科分类号
摘要
Many current international scientific projects are based on large scale applications that are both computationally complex and require the management of large amounts of distributed data. Grid computing is fast emerging as the solution to the problems posed by these applications. To evaluate the impact of resource optimisation algorithms, simulation of the Grid environment can be used to achieve important performance results before any algorithms are deployed on the Grid. In this paper, we study the effects of various job scheduling and data replication strategies and compare them in a variety of Grid scenarios using several performance metrics. We use the Grid simulator OptorSim, and base our simulations on a world-wide Grid testbed for data intensive high energy physics experiments. Our results show that scheduling algorithms which take into account both the file access cost of jobs and the workload of computing resources are the most effective at optimising computing and storage resources as well as improving the job throughput. The results also show that, in most cases, the economybased replication strategies which we have developed improve the Grid performance under changing network loads. © 2004 Kluwer Academic Publishers.
引用
收藏
页码:57 / 69
页数:12
相关论文
共 25 条
[1]  
Arlitt M.F., Williamson C.L., Web ServerWorkload Characterization: The Search for Invariants, ACM Sigmetrics International Conference On Measurements and Modeling of Computer Systems
[2]  
Barford P., Crovella M., Generating Representative Web Workloads for Network and Server Performance Evaluation, ACM Sigmetrics International Conference On Measurements and Modeling of Computer Systems
[3]  
Bell W.H., Cameron D.G., Capozza L., Millar P., Stockinger K., Zini F., Design of a Replica Optimisation Framework, (2002)
[4]  
Bell W.H., Cameron D.G., Capozza L., Millar P., Stockinger K., Zini F., OptorSim - a Grid Simulator for Studying Dynamic Data Replication Strategies, International Journal of High Performance Computing Applications, 17, 4, (2003)
[5]  
Bell W.H., Cameron D.G., Carvajal-Schiaffino R., Millar P., Stockinger K., Zini F., Evaluation of an Economy-Based File Replication Strategy for a Data Grid, International Workshop On Agent Based Cluster and Grid Computing At CCGrid2003
[6]  
Breslau L., Et al., Web Caching and Zipf-like Distributions: Evidence and Implications, IEEE INFOCOM'99
[7]  
Buyya R., Murshed M., GridSim: A Toolkit for the Modeling and Simulation of Distributed Resource Management and Scheduling for Grid Computing, The Journal of Concurrency and Computation: Practice and Experience, pp. 1-32, (2002)
[8]  
Capozza L., Stockinger K., Zini F., Preliminary Evaluation of Revenue Prediction Functions For Economically-Effective File Replication, (2002)
[9]  
Crosby P., EDGSim
[10]  
Huffman B.T., Et al., The CDF/DO UK GridPP Project