On the scalability of 2-D discrete wavelet transform algorithms

被引:18
作者
Fridman, J
Manolakos, ES
机构
[1] Northeastern University,Communications and Digital Signal Processing (CDSP), Center for Research and Graduate Studies, Electrical and Computer Engineering Department
关键词
wavelets; scalability; parallel processing;
D O I
10.1023/A:1008229209464
中图分类号
TP301 [理论、方法];
学科分类号
081202 ;
摘要
The ability of a parallel algorithm to make efficient use of increasing computational resources is known as its scalability. In this paper, we develop four parallel algorithms for the 2-dimensional Discrete Wavelet Transform algorithm (2-D DWT), and derive their scalability properties on Mesh and Hypercube interconnection networks. We consider two versions of the 2-D DWT algorithm, known as the Standard (S) and Non-standard (NS) forms, mapped onto P processors under two data partitioning schemes, namely checkerboard (CP) and stripped (SP) partitioning. The two checkerboard partitioned algorithms on the cut-through-routed (CT-routed) Mesh are scalable as M(2) = Omega(P log P) (Non-standard form, NS-CP), and as M(2) = Omega(P log(2) P) (Standard form, S-CP); while on the store-and-forward-routed (SF-routed) Mesh and Hypercube they are scalable as M(2) = Omega(P-3/3-gamma) (NS-CP), and as M(2) = Omega (P-2/2-gamma) (S-CP), respectively, where M(2) is the number of elements in the input matrix, and gamma is an element of (0, 1) is a parameter relating M to the number of desired octaves J as J = [gamma log M]. On the CT-routed Hypercube, scalability of the NS-form algorithms shows similar behavior as on the CT-routed Mesh. The Standard form algorithm with stripped partitioning (S-SP) is scalable on the CT-routed Hypercube as M(2) = Omega(P-2), and it is unscalable on the CT-routed Mesh. Although asymptotically the stripped partitioned algorithm S-SP on the CT-routed Hypercube would appear to be inferior to its checkerboard counterpart S-CP, detailed analysis based on the proportionality constants of the isoefficiency function shows that S-SP is actually more efficient than S-CP over a realistic range of machine and problem sizes. A milder form of this result holds on the CT- and SF-routed Mesh, where S-SP would, asymptotically, appear to be altogether unscalable.
引用
收藏
页码:185 / 217
页数:33
相关论文
共 34 条
[1]  
ADELSON EH, 1987, VISUAL COMMUNICATION, V2, P50
[2]   WAVELET-LIKE BASES FOR THE FAST SOLUTION OF 2ND-KIND INTEGRAL-EQUATIONS [J].
ALPERT, B ;
BEYLKIN, G ;
COIFMAN, R ;
ROKHLIN, V .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1993, 14 (01) :159-184
[3]   A CLASS OF BASES IN L2 FOR THE SPARSE REPRESENTATION OF INTEGRAL-OPERATORS [J].
ALPERT, BK .
SIAM JOURNAL ON MATHEMATICAL ANALYSIS, 1993, 24 (01) :246-262
[4]  
ANDREW JP, 1994, P INT C AC SPEECH SI
[5]  
BACRY E, 1992, RAIRO-MATH MODEL NUM, V26, P793
[6]  
BELL T, 1995, IEEE SPECTRUM, P25
[7]   FAST WAVELET TRANSFORMS AND NUMERICAL ALGORITHMS .1. [J].
BEYLKIN, G ;
COIFMAN, R ;
ROKHLIN, V .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1991, 44 (02) :141-183
[8]   EFFICIENT REALIZATIONS OF THE DISCRETE AND CONTINUOUS WAVELET TRANSFORMS - FROM SINGLE-CHIP IMPLEMENTATIONS TO MAPPINGS ON SIMD ARRAY COMPUTERS [J].
CHAKRABARTI, C ;
VISHWANATH, M .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1995, 43 (03) :759-771
[9]  
Daubechies I., 1992, 10 LECT WAVELETS
[10]   IMAGE COMPRESSION THROUGH WAVELET TRANSFORM CODING [J].
DEVORE, RA ;
JAWERTH, B ;
LUCIER, BJ .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1992, 38 (02) :719-746