Some useful strategies for unstructured edge-based solvers on shared memory machines

被引:15
作者
Aubry, R. [1 ]
Houzeaux, G. [1 ]
Vazquez, M. [1 ]
Cela, J. M. [1 ]
机构
[1] BSC CNS, Barcelona 08034, Spain
关键词
edge-based solver; shared memory machines; renumbering schemes; cc-NUMA; profile and bandwidth minimizer; RENUMBERING STRATEGIES; PERFORMANCE; MIGRATION; GRIDS;
D O I
10.1002/nme.2973
中图分类号
T [工业技术];
学科分类号
08 ;
摘要
Three strategies for shared memory parallel edge-based solvers are proposed which guarantee that nodes belonging to one thread are not accessed by other threads for vertex-centered discretizations (replace nodes by cells in case of cell-centered discretizations). The algorithms reorder the edges in groups in order for the parallelization to take place at the edge level, possibly through multiple passes, which constitutes the bulk of the work in an edge-based solver. These strategies are presented in an increasing order of programming effort and their performances are also compared. Various renumbering algorithms are considered. Results and timings are given for a classical Computational Fluid Dynamics compressible edge-based solver and a Numerical Weather Prediction compressible dynamic solver for dry air, as well as computational details to illustrate the efficiency of the proposed approach. The influence of the point renumbering on the final edge grouping and efficiency is also studied through numerical results. Copyright (C) 2010 John Wiley & Sons, Ltd.
引用
收藏
页码:537 / 561
页数:25
相关论文
共 41 条
[1]  
[Anonymous], 1999, P 1999 ACM IEEE C SU
[2]  
AUBRY R, 2010, AIAA10916
[3]   Generation of viscous grids at ridges and corners [J].
Aubry, Romain ;
Loehner, Rainald .
INTERNATIONAL JOURNAL FOR NUMERICAL METHODS IN ENGINEERING, 2009, 77 (09) :1247-1289
[4]  
Azimi Reza, 2009, Operating Systems Review, V43, P56, DOI 10.1145/1531793.1531803
[5]   Average-state Jacobians and implicit methods for compressible viscous and turbulent flows [J].
Batten, P ;
Leschziner, MA ;
Goldberg, UC .
JOURNAL OF COMPUTATIONAL PHYSICS, 1997, 137 (01) :38-78
[6]  
BIRCSAK J, 2000, SUPERCOMPUTING 00, P48
[7]   Renumbering unstructured grids to improve the performance of codes on hierarchical memory machines [J].
Burgess, DA ;
Giles, MB .
ADVANCES IN ENGINEERING SOFTWARE, 1997, 28 (03) :189-201
[8]  
Chandra Rohit, 2001, Parallel Programming in OpenMP
[9]  
Chapman B., 2007, USING OPENMP PORTABL
[10]   Page migration with dynamic space-sharing scheduling policies: The case of the SGI O2000 [J].
Corbalan, J ;
Martorell, X ;
Labarta, J .
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2004, 32 (04) :263-288