Matching and scheduling algorithms for minimizing execution time and failure probability of applications in heterogeneous computing

被引:117
作者
Dogan, A [1 ]
Özgüner, F [1 ]
机构
[1] Ohio State Univ, Dept Elect Engn, Columbus, OH 43210 USA
关键词
matching and scheduling; precedence-constrained tasks; heterogeneous computing; reliability; articulation points and bridges; DLS algorithm;
D O I
10.1109/71.993209
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
In a heterogeneous distributed computing system, machine and network failures are inevitable and can have an adverse effect on applications executing on the system. To reduce the effect of failures on an application executing on a failure-prone system, matching and scheduling algorithms which minimize not only the execution time but also the probability of failure of the application must be devised. However, because of the conflicting requirements, it is not possible to minimize both of the objectives at the same time. Thus, the goal of this paper is to develop matching and scheduling algorithms which account for both the execution time and the reliability of the application. This goal is achieved by modifying an existing matching and scheduling algorithm. The reliability of resources is taken into account using an incremental cost function proposed in this paper and the new algorithm is referred to as the reliable dynamic level scheduling algorithm. The incremental cost function can be defined based on one of the three cost functions developed here. These cost functions are unique in the sense that they are not restricted to tree-based networks and a specific matching and scheduling algorithm. The simulation results confirm that the proposed incremental cost function can be incorporated into matching and scheduling algorithms to produce schedules where the effect of failures of machines and network resources on the execution of the application is reduced and the execution time of the application is minimized as well.
引用
收藏
页码:308 / 323
页数:16
相关论文
共 27 条
[1]  
[Anonymous], THESIS OHIO STATE U
[3]   Generational scheduling for dynamic task management in heterogeneous computing systems [J].
Carter, BR ;
Watson, DW ;
Freund, RF ;
Keith, E ;
Mirabile, F ;
Siegel, HJ .
INFORMATION SCIENCES, 1998, 106 (3-4) :219-236
[4]   A cut-based method for terminal-pair reliability [J].
Chen, YG ;
Yuang, MC .
IEEE TRANSACTIONS ON RELIABILITY, 1996, 45 (03) :413-416
[5]  
Cormen T.H., 1997, Introduction to Algorithms
[6]  
Dogan A, 2000, PROCEEDINGS OF THE INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED PROCESSING TECHNIQUES AND APPLICATIONS, VOLS I-V, P549
[7]  
DOGAN A, 2001, P INT PAR DISTR PROC
[8]   Globus: A metacomputing infrastructure toolkit [J].
Foster, I ;
Kesselman, C .
INTERNATIONAL JOURNAL OF SUPERCOMPUTER APPLICATIONS AND HIGH PERFORMANCE COMPUTING, 1997, 11 (02) :115-128
[9]  
FREUND RF, 1993, COMPUTER, V26, P13
[10]  
*INT WORK GROUP IN, 2000, FY2001 BLUE BOOK