Scalability of parallel scientific applications on the cloud

被引:16
作者
Srirama, Satish Narayana [1 ]
Batrashev, Oleg [1 ]
Jakovits, Pelle [1 ]
Vainikko, Eero [1 ]
机构
[1] Univ Tartu, Inst Comp Sci, Distributed Syst Grp, Tartu, Estonia
关键词
Scientific computing; cloud computing; MapReduce; benchmarking; iterative solvers; parallel programming;
D O I
10.1155/2011/361854
中图分类号
TP31 [计算机软件];
学科分类号
081205 [计算机软件];
摘要
Cloud computing, with its promise of virtually infinite resources, seems to suit well in solving resource greedy scientific computing problems. To study the effects of moving parallel scientific applications onto the cloud, we deployed several benchmark applications like matrix-vector operations and NAS parallel benchmarks, and DOUG (Domain decomposition On Unstructured Grids) on the cloud. DOUG is an open source software package for parallel iterative solution of very large sparse systems of linear equations. The detailed analysis of DOUG on the cloud showed that parallel applications benefit a lot and scale reasonable on the cloud. We could also observe the limitations of the cloud and its comparison with cluster in terms of performance. However, for efficiently running the scientific applications on the cloud infrastructure, the applications must be reduced to frameworks that can successfully exploit the cloud resources, like the MapReduce framework. Several iterative and embarrassingly parallel algorithms are reduced to the MapReduce model and their performance is measured and analyzed. The analysis showed that Hadoop MapReduce has significant problems with iterative methods, while it suits well for embarrassingly parallel algorithms. Scientific computing often uses iterative methods to solve large problems. Thus, for scientific computing on the cloud, this paper raises the necessity for better frameworks or optimizations for MapReduce.
引用
收藏
页码:91 / 105
页数:15
相关论文
共 41 条
[1]
Amazon Inc, HIGH PERF COMP US AM
[2]
[Anonymous], OPENNEBULA OP SOURC
[3]
[Anonymous], 2009, CLOUDS BERKELEY VIEW
[4]
[Anonymous], MAPSCALE CLOUD ENV S
[5]
[Anonymous], P 2010 ACM SIGMOD IN, DOI [DOI 10.1145/1807167.1807184, 10.1145/1807167.1807184]
[6]
[Anonymous], AM EL COMP CLOUD AM
[7]
[Anonymous], 2004, Springer Texts in Statistics
[8]
[Anonymous], NAS Parallel Benchmarks
[9]
[Anonymous], HADOOP
[10]
[Anonymous], 1996, Iterative Methods for Sparse Linear Systems