Analysing astronomy algorithms for graphics processing units and beyond

被引:28
作者
Barsdell, B. R. [1 ]
Barnes, D. G. [1 ]
Fluke, C. J. [1 ]
机构
[1] Swinburne Univ Technol, Ctr Astrophys & Supercomp, Hawthorn, Vic 3122, Australia
关键词
gravitational lensing: micro; methods: data analysis; pulsars: general; IMPLEMENTATION; SIMULATIONS; PERFORMANCE; LENS;
D O I
10.1111/j.1365-2966.2010.17257.x
中图分类号
P1 [天文学];
学科分类号
0704 ;
摘要
Astronomy depends on ever-increasing computing power. Processor clock rates have plateaued, and increased performance is now appearing in the form of additional processor cores on a single chip. This poses significant challenges to the astronomy software community. Graphics processing units (GPUs), now capable of general-purpose computation, exemplify both the difficult learning curve and the significant speedups exhibited by massively parallel hardware architectures. We present a generalized approach to tackling this paradigm shift, based on the analysis of algorithms. We describe a small collection of foundation algorithms relevant to astronomy and explain how they may be used to ease the transition to massively parallel computing architectures. We demonstrate the effectiveness of our approach by applying it to four well-known astronomy problems: Hogbom clean, inverse ray-shooting for gravitational lensing, pulsar dedispersion and volume rendering. Algorithms with well-defined memory access patterns and high arithmetic intensity stand to receive the greatest performance boost from massively parallel architectures, while those that involve a significant amount of decision-making may struggle to take advantage of the available processing power.
引用
收藏
页码:1936 / 1944
页数:9
相关论文
共 28 条
  • [1] Amdahl, 1967, AFIPS C P, P483, DOI DOI 10.1145/1465482.1465560
  • [2] [Anonymous], 1965, ELECTRONICS
  • [3] [Anonymous], 2006, Tech. rep.
  • [4] [Anonymous], 1998, SORTING SEARCHING
  • [5] A View of the Parallel Computing Landscape
    Asanovic, Krste
    Bodik, Rastislav
    Demmel, James
    Keaveny, Tony
    Keutzer, Kurt
    Kubiatowicz, John
    Morgan, Nelson
    Patterson, David
    Sen, Koushik
    Wawrzynek, John
    Wessel, David
    Yelick, Katherine
    [J]. COMMUNICATIONS OF THE ACM, 2009, 52 (10) : 56 - 67
  • [6] Computational advances in gravitational microlensing: A comparison of CPU, GPU, and parallel, large data codes
    Bate, N. F.
    Fluke, C. J.
    Barsdell, B. R.
    Garsden, H.
    Lewis, G. F.
    [J]. NEW ASTRONOMY, 2010, 15 (08) : 726 - 734
  • [7] High performance direct gravitational N-body simulations on graphics processing units II:: An implementation in CUDA
    Belleman, Robert G.
    Bedorf, Jeroen
    Portegies Zwart, Simon
    [J]. NEW ASTRONOMY, 2008, 13 (02) : 103 - 112
  • [8] Programming parallel algorithms
    Blelloch, GE
    [J]. COMMUNICATIONS OF THE ACM, 1996, 39 (03) : 85 - 97
  • [9] Briggs D. S., 1995, PhD thesis
  • [10] A performance study of general-purpose applications on graphics processors using CUDA
    Che, Shuai
    Boyer, Michael
    Meng, Jiayuan
    Tarjan, David
    Sheaffer, Jeremy W.
    Skadron, Kevin
    [J]. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2008, 68 (10) : 1370 - 1380