SIGNAL COMPRESSION BASED ON MODELS OF HUMAN PERCEPTION

被引:483
作者
JAYANT, N
JOHNSTON, J
SAFRANEK, R
机构
[1] Signal Processing Research Department, AT&T Bell Laboratories, Murray Hill, NJ
关键词
D O I
10.1109/5.241504
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
The problem of signal compression is to achieve a low bit rate in the digital representation of an input signal with minimum perceived loss of signal quality. In compressing signals such as speech, audio, image, and video, the ultimate criterion of signal quality is usually that judged or measured by the human receiver. As we seek lower bit rates in the digital representations of these signals, it is imperative that we design the compression (or coding) algorithm to minimize perceptually meaningful measures of signal distortion, rather than more traditional and tractable criteria such as the mean squared difference between the waveforms at the input and output of the coding system. This paper develops the notion of perceptual coding based on the concept of distortion masking by the signal being compressed, and describes how the field has progressed as a result of advances in classical coding theory, modeling of human perception, and digital signal processing. We propose that fundamental limits in the science can be expressed by the semi-quantitative concepts of perceptual entropy and the perceptual distortion-rate function, and we examine current compression technology with respect to that framework. We conclude with a summary of future challenges and research directions.
引用
收藏
页码:1385 / 1422
页数:38
相关论文
共 177 条
  • [1] DISCRETE COSINE TRANSFORM
    AHMED, N
    NATARAJAN, T
    RAO, KR
    [J]. IEEE TRANSACTIONS ON COMPUTERS, 1974, C 23 (01) : 90 - 93
  • [2] AHUMADA AJ, 1987, J OPT SOC AM A, V4, P2372, DOI 10.1364/JOSAA.4.002372
  • [3] AHUMADA AJ, 1992, P SPIE C HUMAN VISIO, V3, P365
  • [4] Aizawa K., 1989, Signal Processing: Image Communication, V1, P139, DOI 10.1016/0923-5965(89)90006-4
  • [5] [Anonymous], 1975, DELTA MODULATION SYS
  • [6] [Anonymous], 1971, RATE DISTORTION THEO
  • [7] [Anonymous], 1992, SPRINGER INT
  • [8] ATAL BS, 1986, P ICASSP, P1681
  • [9] Barnsley MF., 2014, FRACTALS EVERYWHERE
  • [10] BASERI R, 1992, P ICASSP, V3, P365