Intelligent monitoring for adaptation in grid applications

被引:13
作者
Reed, DA [1 ]
Mendes, CL
机构
[1] Univ N Carolina, Renaissance Comp Inst, Chapel Hill, NC 27599 USA
[2] Univ Illinois, Dept Comp Sci, Urbana, IL 61801 USA
关键词
adaptive control; distributed computing; monitoring; parallel processing;
D O I
10.1109/JPROC.2004.840300
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Grid applications access distributed, and often shared, resources. One consequence of this resource sharing is that measured application performance can vary widely and in unexpected ways. Determining the causes of poor performance, due to either anomalous application behavior or contention for shared resource use, and adapting to changing circumstances are critical to creation of robust Grid applications. Performance contracts and real-time adaptive control are two mechanisms to realize soft performance guarantees in Grid environments. Performance contracts formalize the relationship between application performance needs and resource capabilities. During execution, contract monitors use performance data to verify that expectations are met. When the contracted Specifications are not satisfied, the system can choose to either adapt the application to available resources or reschedule the application oil a new set of resources that can satisfy the original contract specifications. In this paper we describe an infrastructure for Grid application contract development and monitoring. This infrastructure, based oil the Autopilot toolkit, provides flexible and scalable tools to assess both application and system behavior.
引用
收藏
页码:426 / 435
页数:10
相关论文
共 33 条
[1]  
ALLEN G, P 9 IEEE INT S HIGH, P253
[2]  
[Anonymous], BEOWULF CLUSTER COMP
[3]  
[Anonymous], TEMPORAL LOGIC MATH
[4]   The GrADS project: Software support for high-level grid application development [J].
Berman, F ;
Chien, A ;
Cooper, K ;
Dongarra, J ;
Foster, I ;
Gannon, D ;
Johnsson, L ;
Kennedy, K ;
Kesselman, C ;
Mellor-Crummey, J ;
Reed, D ;
Torczon, L ;
Wolski, R .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2001, 15 (04) :327-344
[5]  
BROWNE S, 2000, SUP 2000 C DALL TX
[6]   An api for runtime code patching [J].
Buck, B ;
Hollingsworth, JK .
INTERNATIONAL JOURNAL OF HIGH PERFORMANCE COMPUTING APPLICATIONS, 2000, 14 (04) :317-329
[7]  
CAMPBELL A, 1997, BUILDING QOS INFO DI
[8]  
CATLETT C, 2001, GFDC2
[9]  
Cooper K, 2004, NSF NEXT GEN SOFTW W
[10]   GriPhyN and LIGO, building a Virtual Data Grid for gravitational wave scientists [J].
Deelman, E ;
Kesselman, C ;
Mehta, G ;
Meshkat, L ;
Pearlman, L ;
Blackburn, K ;
Ehrens, P ;
Lazzarini, A ;
Williams, R ;
Koranda, S .
11TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE DISTRIBUTED COMPUTING, PROCEEDINGS, 2002, :225-234