GPU-based iterative transmission reconstruction in 3D ultrasound computer tomography

被引:39
作者
Birk, Matthias [1 ,3 ]
Dapp, Robin [1 ,3 ]
Ruiter, N. V. [1 ,3 ]
Becker, J. [2 ,3 ]
机构
[1] Karlsruhe Inst Technol, Inst Data Proc & Elect IPE, D-76334 Eggenstein Leopoldshafen, Germany
[2] Karlsruhe Inst Technol, Inst Informat Proc Technol ITIV, D-76131 Karlsruhe, Germany
[3] Karlsruhe Inst Technol, D-76021 Karlsruhe, Germany
关键词
GPU; Ultrasound imaging; Application acceleration; SpMV; Numerical optimization; ALGORITHMS;
D O I
10.1016/j.jpdc.2013.09.007
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
As today's standard screening methods frequently fail to detect breast cancer before metastases have developed, early diagnosis is still a major challenge. With the promise of high-quality volume images, three-dimensional ultrasound computer tomography is likely to improve this situation, but has high computational needs. In this work, we investigate the acceleration of the ray-based transmission reconstruction by a GPU-based implementation of the iterative numerical optimization algorithm TVAL3. We identified the regular and transposed sparse-matrix-vector multiply as the performance limiting operations. For accelerated reconstruction we propose two different concepts and devise a hybrid scheme as optimal configuration. In addition we investigate multi-GPU scalability and derive the optimal number of devices for our two primary use-cases: a fast preview mode and a high-resolution mode. In order to achieve a fair estimation of the speedup, we compare our implementation to an optimized CPU version of the algorithm. Using our accelerated implementation we reconstructed a preview 3D volume with 24,576 unknowns, a voxel size of (8 mm)(3) and approximately 200,000 equations in 0.5 s. A high-resolution volume with 1,572,864 unknowns, a voxel size of (2mm)(3) and approximately 1.6 million equations was reconstructed in 23 s. This constitutes an acceleration of over one order of magnitude in comparison to the optimized CPU version. (C) 2013 Elsevier Inc. All rights reserved.
引用
收藏
页码:1730 / 1743
页数:14
相关论文
共 34 条
  • [1] [Anonymous], MSRTR201295
  • [2] [Anonymous], 2008, NVR2008004
  • [3] [Anonymous], 2012, CUDA C Best Practices Guide
  • [4] B\\aerentzen JA., 2001, On the implementation of fast marching methods for 3D lattices
  • [5] Bell N, 2009, PROCEEDINGS OF THE CONFERENCE ON HIGH PERFORMANCE COMPUTING NETWORKING, STORAGE AND ANALYSIS
  • [6] Birk M., 2012, J REAL-TIME IMAGE PR, P1
  • [7] GPU accelerated greedy algorithms for compressed sensing
    Blanchard J.D.
    Tanner J.
    [J]. Mathematical Programming Computation, 2013, 5 (3) : 267 - 304
  • [8] Sparse matrix solvers on the GPU:: Conjugate gradients and multigrid
    Bolz, J
    Farmer, I
    Grinspun, E
    Schröder, P
    [J]. ACM TRANSACTIONS ON GRAPHICS, 2003, 22 (03): : 917 - 924
  • [9] ALGORITHM FOR COMPUTER CONTROL OF A DIGITAL PLOTTER
    BRESENHAM, JE
    [J]. IBM SYSTEMS JOURNAL, 1965, 4 (01) : 25 - 30
  • [10] Robust uncertainty principles:: Exact signal reconstruction from highly incomplete frequency information
    Candès, EJ
    Romberg, J
    Tao, T
    [J]. IEEE TRANSACTIONS ON INFORMATION THEORY, 2006, 52 (02) : 489 - 509