The design and implementation of FFTW3

被引:3257
作者
Frigo, M [1 ]
Johnson, SG
机构
[1] IBM Corp, Austin Res Lab, Austin, TX 78758 USA
[2] MIT, Cambridge, MA 02139 USA
关键词
adaptive software; cosine transform; fast Fourier transform (FFT); Fourier transform; Hartley transform; I/O tensor;
D O I
10.1109/JPROC.2004.840301
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
FFTW is an implementation of the discrete Fourier transform (LEFT) that adapts to the hardware in order to maximize performance. This paper shows that such an approach can yield an implementation that is competitive with hand-optimized libraries, and describes the software structure that makes our current FFTW3 version flexible and adaptive. We further discuss a new algorithm for real-data DFTs of prime size, a new way of implementing DFTs by means of machine-specific single-instruction, multiple-data (SIMD) instructions, and how a special-purpose compiler can derive optimized implementations of the discrete cosine and sine transforms automatically from a DFT algorithm.
引用
收藏
页码:216 / 231
页数:16
相关论文
共 47 条