Stork: Making data placement a first class citizen in the Grid

被引:76
作者
Kosar, T [1 ]
Livny, M [1 ]
机构
[1] Univ Wisconsin, Dept Comp Sci, Madison, WI 53706 USA
来源
24TH INTERNATIONAL CONFERENCE ON DISTRIBUTED COMPUTING SYSTEMS, PROCEEDINGS | 2004年
关键词
D O I
10.1109/ICDCS.2004.1281599
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
Todays scientific applications have huge data requirements which continue to increase drastically every year These data are generally accessed by many users from all across the the globe. This implies a major necessity to move huge amounts of data around wide area networks to complete the computation cycle, which brings with it the problem of efficient and reliable data placement. The current approach to solve this problem of data placement is either doing it manually, or employing simple scripts which do not have any automation or fault tolerance capabilities. Our goal is to make data placement activities first class citizens in the Grid just like the computational jobs. They will be queued, scheduled, monitored, managed, and even check-pointed. More importantly, it will be made sure that they complete successfully and without any human interaction. We also believe that data placement jobs should be treated difterently from computational jobs, since they may have different semantics and different characteristics. For this purpose, we have developed Stork, a scheduler for data placement activities in the Grid.
引用
收藏
页码:342 / 349
页数:8
相关论文
共 20 条
[1]  
Allcock B., 2001, IEEE MASS STOR C SAN
[2]  
[Anonymous], P 8 INT C DISTR COMP
[3]  
[Anonymous], 1998, P 7 IEEE INT S HIGH
[4]  
BARU C, 1998, P CASCON TOR CAN
[5]  
BIRD I, 2001, P 18 IEEE S MASS STO
[6]  
BUTLER M, 1998, P 40 CRAY US GROUP C
[7]  
*COND, 2003, DIR ACYCL GRAPH MAN
[8]  
DJORGOVSKI SG, 1988, WIDE FIELD SURVEYS C
[9]  
FENG W, 2003, HIGH PERFORMANCE TRA
[10]  
Foster I, 1999, GRID: BLUEPRINT FOR A NEW COMPUTING INFRASTRUCTURE, P259