Perceptually-Friendly H.264/AVC Video Coding Based on Foveated Just-Noticeable-Distortion Model

被引:167
作者
Chen, Zhenzhong [1 ]
Guillemot, Christine [2 ]
机构
[1] INRIA IRISA, F-35042 Rennes, France
[2] Inst Natl Rech Informat & Automat, F-35042 Rennes, France
关键词
Bit allocation; foveation model; H264/advanced video coding (AVC); human visual system (HVS); just-noticeable-distortion (JND); video coding; BIT ALLOCATION; MOTION ESTIMATION; VISUAL-ATTENTION; REGION; CODER; QUANTIZATION; COMMUNICATION; IMAGES;
D O I
10.1109/TCSVT.2010.2045912
中图分类号
TM [电工技术]; TN [电子技术、通信技术];
学科分类号
0808 ; 0809 ;
摘要
Traditional video compression methods remove spatial and temporal redundancy based on the signal statistical correlation. However, to reach higher compression ratios without perceptually degrading the reconstructed signal, the properties of the human visual system (HVS) need to be better exploited. Research effort has been dedicated to modeling the spatial and temporal just-noticeable-distortion (JND) based on the sensitivity of the HVS to luminance contrast, and accounting for spatial and temporal masking effects. This paper describes a foveation model as well as a foveated JND (FJND) model in which the spatial and temporal JND models are enhanced to account for the relationship between visibility and eccentricity. Since the visual acuity decreases when the distance from the fovea increases, the visibility threshold increases with increased eccentricity. The proposed FJND model is then used for macroblock (MB) quantization adjustment in H.264/advanced video coding (AVC). For each MB, the quantization parameter is optimized based on its FJND information. The Lagrange multiplier in the rate-distortion optimization is adapted so that the MB noticeable distortion is minimized. The performance of the FJND model has been assessed with various comparisons and subjective visual tests. It has been shown that the proposed FJND model can increase the visual quality versus rate performance of the H.264/AVC video coding scheme.
引用
收藏
页码:806 / 819
页数:14
相关论文
共 74 条
[31]  
*ITU T, 2000, H263 ITUT
[32]   SIGNAL COMPRESSION BASED ON MODELS OF HUMAN PERCEPTION [J].
JAYANT, N ;
JOHNSTON, J ;
SAFRANEK, R .
PROCEEDINGS OF THE IEEE, 1993, 81 (10) :1385-1422
[33]   Estimating just-noticeable distortion for video [J].
Jia, Yuting ;
Lin, Weisi ;
Kassim, Ashraf A. .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2006, 16 (07) :820-829
[34]   Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models [J].
Kamaci, N ;
Altunbasak, Y ;
Mersereau, RM .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2005, 15 (08) :994-1006
[36]   Retinally reconstructed images: Digital images having a resolution match with the human eye [J].
Kuyel, T ;
Geisler, W ;
Ghosh, J .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1999, 29 (02) :235-243
[37]   A coherent computational approach to model bottom-up visual attention [J].
Le Meur, O ;
Le Callet, P ;
Barba, D ;
Thoreau, D .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2006, 28 (05) :802-817
[38]   Objective video quality assessment [J].
Lee, C ;
Cho, S ;
Choe, J ;
Jeong, T ;
Ahn, W ;
Lee, E .
OPTICAL ENGINEERING, 2006, 45 (01)
[39]   Adaptive rate control for H.264 [J].
Li, Z. G. ;
Gao, W. ;
Pan, F. ;
Ma, S. W. ;
Lim, K. P. ;
Feng, G. N. ;
Lin, X. ;
Rahardja, S. ;
Lu, H. Q. ;
Lu, Y. .
JOURNAL OF VISUAL COMMUNICATION AND IMAGE REPRESENTATION, 2006, 17 (02) :376-406
[40]  
Lin W., 2006, DIGITAL VIDEO IMAGE