Perceptual coders and perceptual metrics

被引：9

作者：

Chen, JQ ^{[1
]}

Pappas, TN ^{[1
]}

机构：

[1] Northwestern Univ, Dept Elect & Comp Engn, Evanston, IL 60208 USA

来源：

HUMAN VISION AND ELECTRONIC IMAGING VI | 2001年 / 4299卷

关键词：

perceptual model; perceptually lossless compression; human visual system; perceptual subband; image coder; SPIHT; JPEG; EZW; perceptual PSNR;

D O I：

10.1117/12.429485

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

We examine perceptual metrics and use them to evaluate the quality of still image coders. We show that mean-squared-error based metrics (such as PSNR) fail to predict image quality when one compares artifacts generated by different types of image coders (e.g., block-based, subband, and wavelet coders). We consider three different types of coders: JPEG, the Safranek-Johnston perceptual subband coder (PIC), and the Said-Pearlman SPIHT algorithm with perceptually weighted subband quantization, based on the Watson et al. visual thresholds. We show that incorporating perceptual weighting in the SPIHT algorithm results in significant improvement in visual quality. The metrics we consider are based on the same image decompositions (subband, wavelet, DCT) as the corresponding compression algorithms. Such metrics are computationally efficient and considerably simpler than more elaborate metrics (e.g., by Daly, Lubin, and Teo and Heeger). However, since each of the metrics is used for the optimization of a coder, one expects that they would be biased towards that coder. We use the metrics to evaluate the performance of the compression techniques for a wide range of bit rates. Our experiments indicate that the PIC metric provides the best correlation with subjective evaluations. It predicts that at very low bit rates the SPIHT algorithm and the 8 x 8 PIC coder perform the best, while at high bit rates the 4 x 4 PIC coder is the best. More importantly, we show that the relative algorithm performance depends on image content, with the subband and DCT coders performing best for images with a lot of high frequency content, and the wavelet coders performing best for smoother images.

引用

页码：150 / 162

页数：13

共 30 条

[1]

[Anonymous], SOC INFORM DISPLAY D

[2]

Daly S., 1993, The visible differences predictor, P179

[3] Perceptual quality metrics applied to still image compression [J].

Eckert, MP ;

Bradley, AP .

SIGNAL PROCESSING, 1998, 70 (03) :177-200

[4]

Hahn PJ, 1998, 1998 INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL 3, P404, DOI 10.1109/ICIP.1998.999030

[5] Wavelet coefficient quantization to produce equivalent visual distortions in complex stimuli [J].

Hemami, SS ;

Ramos, MG .

HUMAN VISION AND ELECTRONIC IMAGING V, 2000, 3959 :200-210

[6]

Hontsch I, 1997, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, P37, DOI 10.1109/ICIP.1997.647378

[7]

HONTSCH I, 1997, P IEEE INT C IM PROC, V1, P41

[8] Image coding by perceptual pruning with a cortical snapshot indistinguishability criterion [J].

Horowitz, MJ ;

Neuhoff, DL .

HUMAN VISION AND ELECTRONIC IMAGING III, 1998, 3299 :330-339

[9] SIGNAL COMPRESSION BASED ON MODELS OF HUMAN PERCEPTION [J].

JAYANT, N ;

JOHNSTON, J ;

SAFRANEK, R .

PROCEEDINGS OF THE IEEE, 1993, 81 (10) :1385-1422

[10] Perceptual quality measure using a spatio-temporal model of the human visual system [J].

Lambrecht, CJV ;

Verscheure, O .

DIGITAL VIDEO COMPRESSION: ALGORITHMS AND TECHNOLOGIES 1996, 1996, 2668 :450-461

← 1 2 3 →