Real-time foveation techniques for low bit rate video coding

被引:22
作者
Sheikh, HR [1 ]
Evans, BL [1 ]
Bovik, AC [1 ]
机构
[1] Univ Texas, Dept Elect & Comp Engn, Lab Image & Video Engn, Austin, TX 78712 USA
关键词
D O I
10.1016/S1077-2014(02)00116-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Lossy video compression methods often rely on modeling the abilities and limitations of the intended receiver, the human visual system (HVS), to achieve the highest possible compression with as little effect on perceived quality as possible. Foveation, which is non-uniform resolution perception of the visual stimulus by the FINS due to the non-uniform density of photoreceptor cells in the eye, has been demonstrated to be useful for reducing bit rates beyond the abilities of uniform resolution video coders. In this work, we present real-time foveation techniques for low bit rate video coding. First, we develop an approximate model for foveation. Then, we demonstrate that foveation, as described by this model, can be incorporated into standard motion compensation and discrete cosine transform (DCT)-based video coding techniques for low bit rate video coding, such as the H.263 or MPEG-4 video coding standards, without incurring prohibitive complexity overhead. We demonstrate that foveation in the DCT domain can actually result in computational speed-Ups. The techniques presented can be implemented using the baseline modes in the video coding standards and do not require any modification to, or post-processing at, the decoder. (C) 2003 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:27 / 40
页数:14
相关论文
共 28 条
[1]  
[Anonymous], 1999, VIDEO CODING INTRO S
[2]  
[Anonymous], THESIS U TEXAS AUSTI
[3]  
*APPL SCI LAB, CHOOS EY TRACK SYST
[4]  
Arai Y., 1988, Transactions of the Institute of Electronics, Information and Communication Engineers E, VE71, P1095
[5]   PERIPHERAL SPATIAL VISION - LIMITS IMPOSED BY OPTICS, PHOTORECEPTORS, AND RECEPTOR POOLING [J].
BANKS, MS ;
SEKULER, AB ;
ANDERSON, SJ .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1991, 8 (11) :1775-1787
[6]   Enhancing videoconferencing using spatially varying sensing [J].
Basu, A ;
Wiebe, KJ .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1998, 28 (02) :137-148
[7]   Prediction and tracking of moving objects in image sequences [J].
Bors, AG ;
Pitas, I .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2000, 9 (08) :1441-1445
[8]  
*DSC, TMS320DSC21 TEX INST
[9]  
Dudgeon D. E., 1984, MULTIDIMENSIONAL DIG
[10]  
ENG A, 2000, P IEEE INT C IM PROC, V3, P758