Dynamic resource management on distributed systems using reconfigurable applications

被引:16
作者
Moreira, JE
Naik, VK
机构
[1] IBM Research Division, Thomas J. Watson Research Center, Yorktown Heights, NY 10598
[2] Scalable Parallel Systems Department, IBM Thomas J. Watson Research Center
[3] University of Illinois, Urbana-Champaign, IL
[4] Servers Department, IBM Thomas J. Watson Research Center
[5] ICASE, NASA Langley Research Center
[6] Indian Institute of Technology, Madras
关键词
D O I
10.1147/rd.413.0303
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Efficient management of distributed resources, under conditions of unpredictable and varying workload, requires enforcement of dynamic resource management policies. Execution of such policies requires a relatively fine-grain control over the resources allocated to jobs in the system. Although this is a difficult task using conventional jab management and program execution models, reconfigurable applications can be used to make it viable. With reconfigurable applications, it is possible to dynamically change, during the course of program execution, the number of concurrently executing tasks of an application as well as the resources allocated. Thus, reconfigurable applications can adapt to internal changes in resource requirements and to external changes affecting available resources. In this paper, we discuss dynamic management of resources on distributed systems with the help of reconfigurable applications. We first characterize reconfigurable parallel applications. We then present a new programming model for reconfigurable applications and the Distributed Resource Management System (DRMS), an integrated environment for the design, development, execution, and resource scheduling of reconfigurable applications. Experiments were conducted to verify the functionality and performance of application reconfiguration under DRMS. A detailed breakdown of the costs in reconfiguration is presented with respect to several different applications. Our results indicate that application reconfiguration is effective under DRMS and can be beneficial in improving individual application performance as well as overall system performance. We observe a significant reduction in average job response time and an improvement in overall system utilization.
引用
收藏
页码:303 / 330
页数:28
相关论文
共 46 条
[1]  
AGERWALA T, 1995, IBM SYST J, V34, P152, DOI 10.1147/sj.342.0152
[2]  
ANGELACCIO M, 1994, P FAULT TOLERANT PAR, P151
[3]  
[Anonymous], 1991, RNR91002 NASA AM RES
[4]   Dome: Parallel programming in a distributed computing environment [J].
Arabe, JNC ;
Beguelin, A ;
Lowekamp, B ;
Seligman, E ;
Starkey, M ;
Stephan, P .
10TH INTERNATIONAL PARALLEL PROCESSING SYMPOSIUM - PROCEEDINGS OF IPPS '96, 1996, :218-224
[5]  
Bailey D. H., 1993, IEEE Parallel & Distributed Technology: Systems & Applications, V1, P43, DOI 10.1109/88.219861
[6]  
Bailey D. H., 1994, RNR94007 NASA AM RES
[7]  
BENKNER S, 1992, 1992 P SCAL HIGH PER, P51
[8]  
Bodin F., 1993, Scientific Programming, V2, P7
[9]   COMPILING FORTRAN 90D/HPF FOR DISTRIBUTED-MEMORY MIMD COMPUTERS [J].
BOZKUS, Z ;
CHOUDHARY, A ;
FOX, G ;
HAUPT, T ;
RANKA, S ;
WU, MY .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 1994, 21 (01) :15-26
[10]  
Carriero N., 1990, WRITE PARALLEL PROGR