Cost-efficient task scheduling for executing large programs in the cloud

被引:212
作者
Su, Sen [1 ]
Li, Jian [1 ]
Huang, Qingjia [1 ]
Huang, Xiao [1 ]
Shuang, Kai [1 ]
Wang, Jie [2 ]
机构
[1] Beijing Univ Posts & Telecommun, State Key Lab Networking & Switching Technol, Beijing 100088, Peoples R China
[2] Univ Massachusetts, Dept Comp Sci, Lowell, MA 01854 USA
基金
中国国家自然科学基金; 美国国家科学基金会;
关键词
Cloud computing; Cost-efficient scheduling; Parallel task scheduling; Directed acyclic graph; GRAPHS;
D O I
10.1016/j.parco.2013.03.002
中图分类号
TP301 [理论、方法];
学科分类号
080201 [机械制造及其自动化];
摘要
Executing a large program using clouds is a promising approach, as this class of programs may be decomposed into multiple sequences of tasks that can be executed on multiple virtual machines (VMS) in a cloud. Such sequences of tasks can be represented as a directed acyclic graph (DAG), where nodes are tasks and edges are precedence constraints between tasks. Cloud users pay for what their programs actually use according to the pricing models of the cloud providers. Early task scheduling algorithms are focused on minimizing make-span, without mechanisms to reduce the monetary cost incurred in the setting of clouds. We present a cost-efficient task-scheduling algorithm using two heuristic strategies. The first strategy dynamically maps tasks to the most cost-efficient VMs based on the concept of Pareto dominance. The second strategy, a complement to the first strategy, reduces the monetary costs of non-critical tasks. We carry out extensive numerical experiments on large DAGs generated at random as well as on real applications. The simulation results show that our algorithm can substantially reduce monetary costs while producing make-span as good as the best known task-scheduling algorithm can provide. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:177 / 188
页数:12
相关论文
共 32 条
[1]
[Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[2]
[Anonymous], 2010, Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing
[3]
Armbrust M., 2009, Technical Report UCB/EECS-2009-28
[4]
Irnproving scheduling of tasks in a heterogeneous environment [J].
Bajaj, R ;
Agrawal, DP .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2004, 15 (02) :107-118
[5]
Dynamic scheduling of a batch of parallel task jobs on heterogeneous clusters [J].
Barbosa, Jorge G. ;
Moreira, Belmiro .
PARALLEL COMPUTING, 2011, 37 (08) :428-438
[6]
Bharathi Shishir., 2008, 3 WORKSHOP WORKFLOWS
[7]
DAG Scheduling Using a Lookahead Variant of the Heterogeneous Earliest Finish Time Algorithm [J].
Bittencourt, Luiz F. ;
Sakellariou, Rizos ;
Madeira, Edmundo R. M. .
PROCEEDINGS OF THE 18TH EUROMICRO CONFERENCE ON PARALLEL, DISTRIBUTED AND NETWORK-BASED PROCESSING, 2010, :27-34
[8]
Bozdag D., 2006, Proceedings. 20th International Parallel and Distributed Processing Symposium (IEEE Cat. No.06TH8860), DOI 10.1109/IPDPS.2006.1639389
[9]
Compaction of Schedules and a Two-Stage Approach for Duplication-Based DAG Scheduling [J].
Bozdag, Doruk ;
Oezguener, Fuesun ;
Catalyurek, Umit V. .
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2009, 20 (06) :857-871
[10]
A comparison of eleven static heuristics for mapping a class of independent tasks onto heterogeneous distributed computing systems [J].
Braun, TD ;
Siegel, HJ ;
Beck, N ;
Bölöni, LL ;
Maheswaran, M ;
Reuther, AI ;
Robertson, JP ;
Theys, MD ;
Yao, B ;
Hensgen, D ;
Freund, RF .
JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2001, 61 (06) :810-837