Evaluating MMX technology using DSP and multimedia applications

被引：36

作者：

Bhargava, R ^{[1
]}

John, LK ^{[1
]}

Evans, BL ^{[1
]}

Radhakrishnan, R ^{[1
]}

机构：

[1] Univ Texas, Dept Elect & Comp Engn, Austin, TX 78712 USA

来源：

31ST ANNUAL ACM/IEEE INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE, PROCEEDINGS | 1998年

关键词：

digital signal processing; machine measurement; MMX; performance monitoring; workload characterization;

D O I：

10.1109/MICRO.1998.742767

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

Many current general purpose processors are using extensions to the instruction set architecture to enhance the performance of digital signal processing (USP) and multimedia applications. In this paper, we evaluate the X86 architecture's multimedia extension (MMX) instruction set on a set of benchmarks. Our benchmark suite includes kernels (filtering, fast Fourier transforms, and vector arithmetic) and applications (JPEG compression, Doppler radar processing, imaging, and G.722 speech encoding). Each benchmark has at least one non-MMX version in C and an MMX version that makes calls to an MMX assembly library. The versions differ in the implementation of filtering, vector arithmetic, and other relevant kernels. The observed speedup for the MMX versions of the suite ranges from less than 1.0 to 6.1. In addition to quantifying the speedup, we perform detailed instruction level profiling using Intel's VTune profiling tool. Using VTune, we profile static and dynamic instructions, microarchitecture operations, and dates references to isolate the specific reasons for speedup or lack thereof. This analysis allows one to understand which aspects of native signal processing instruction sets are most useful, the current limitations, and how they can be utilized most efficiently.

引用

页码：37 / 46

页数：10