Parallel implementation of the 2D discrete wavelet transform on Graphics Processing Units:: Filter Bank versus Lifting

被引:80
作者
Tenllado, Christian [1 ]
Setoain, Javier [1 ]
Prieto, Manuel [1 ]
Pinuel, Luis [1 ]
Tirado, Francisco [1 ]
机构
[1] Univ Complutense Madrid, Fac Ciencias Fis, Dept Comp Architecture, ArTeCS Grp, E-28040 Madrid, Spain
关键词
graphics processors; parallel processing; parallel algorithms; parallel and vector implementations; wavelets and fractals; SIMD processors; optimization; parallel discrete wavelet transform; lifting; filter bank; GPU; stream processors;
D O I
10.1109/TPDS.2007.70716
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The widespread usage of the discrete wavelet transform (DWT) has motivated the development of fast DWT algorithms and their tuning on all sorts of computer systems. Several studies have compared the performance of the most popular schemes, known as Filter Bank Scheme (FBS) and Lifting Scheme (LS), and have always concluded that LS is the most efficient option. However, there is no such study on streaming processors such as modern Graphics Processing Units (GPUs). Current trends have transformed these devices into powerful stream processors with enough flexibility to perform intensive and complex floating-point calculations. The opportunities opened up by these platforms, as well as the growing popularity of the DWT within the computer graphics field, make a new performance comparison of great practical interest. Our study indicates that FBS outperforms LS in current-generation GPUs. In our experiments, the actual FBS gains range between 10 percent and 140 percent, depending on the problem size and the type and length of the wavelet filter. Moreover, design trends suggest higher gains in future-generation GPUs.
引用
收藏
页码:299 / 310
页数:12
相关论文
共 49 条
[1]   A survey on lifting-based Discrete Wavelet Transform architectures [J].
Acharya, T ;
Chakrabarti, C .
JOURNAL OF VLSI SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2006, 42 (03) :321-339
[2]  
[Anonymous], P ACM SIGGRAPH EUROG
[3]   Image coding using wavelet transform [J].
Antonini, Marc ;
Barlaud, Michel ;
Mathieu, Pierre ;
Daubechies, Ingrid .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 1992, 1 (02) :205-220
[4]   Texture segmentation using wavelet transform [J].
Arivazhagan, S ;
Ganesan, L .
PATTERN RECOGNITION LETTERS, 2003, 24 (16) :3197-3203
[5]   An efficient architecture for lifting-based two-dimensional discrete wavelet transforms [J].
Barua, S ;
Carletta, JE ;
Kotteri, KA ;
Bell, AE .
INTEGRATION-THE VLSI JOURNAL, 2005, 38 (03) :341-352
[6]   Brook for GPUs: Stream computing on graphics hardware [J].
Buck, I ;
Foley, T ;
Horn, D ;
Sugerman, J ;
Fatahalian, K ;
Houston, M ;
Hanrahan, P .
ACM TRANSACTIONS ON GRAPHICS, 2004, 23 (03) :777-786
[7]   Cache-efficient wavelet lifting in JPEG 2000 [J].
Chatterjee, S ;
Brooks, CD .
IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOL I AND II, PROCEEDINGS, 2002, :797-800
[8]  
CHAVER D, 2002, P 2002 INT C HIGH PE, P9
[9]  
CHAVER D, 2002, P 16 INT PAR DISTR P
[10]  
CHAVER D, 2003, P 17 IEEE INT PAR DI