Foveation scalable video coding with automatic fixation selection

被引:147
作者
Wang, Z [1 ]
Lu, LG
Bovik, AC
机构
[1] Univ Texas, LIVE, Austin, TX 78712 USA
[2] NYU, LCV, New York, NY 10003 USA
[3] IBM Corp, Thomas J Watson Res Ctr, Yorktown Hts, NY 10598 USA
关键词
foveation; human visual system; image and video quality; rate scalable coding; video coding; wavelet;
D O I
10.1109/TIP.2003.809015
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Image and video coding is an optimization problem. A successful image and video coding algorithm delivers a good tradeoff between visual quality and other coding performance measures, such as compression, complexity, scalability, robustness, and security. In this paper, we follow two recent trends in image and video coding research. One is to incorporate human visual system (HVS) models to improve the current state-of-the-art of image and video coding algorithms by better exploiting the properties of the intended receiver. The other is to design rate scalable image and video codecs, which allow the extraction of coded visual information at continuously varying bit rates from a single compressed bitstream. Specifically, we propose a foveation scalable video coding (FSVC) algorithm which supplies good quality-compression performance as well as effective rate scalability. The key idea is to organize the encoded bitstream to provide the best decoded video at an arbitrary bit rate in terms of foveated visual quality measurement. A foveation-based HVS model plays an important role in the algorithm. The algorithm is adaptable to different applications, such as knowledge-based video coding and video communications over time-varying, multiuser and interactive networks.
引用
收藏
页码:243 / 254
页数:12
相关论文
共 63 条
[21]   A real-time foveated multiresolution system for low-bandwidth video communication [J].
Geisler, WS ;
Perry, JS .
HUMAN VISION AND ELECTRONIC IMAGING III, 1998, 3299 :294-305
[22]  
Hontsch I, 1997, INTERNATIONAL CONFERENCE ON IMAGE PROCESSING - PROCEEDINGS, VOL I, P41, DOI 10.1109/ICIP.1997.647379
[23]   Implementation of a foveated image coding system for image bandwidth reduction [J].
Kortum, P ;
Geisler, W .
HUMAN VISION AND ELECTRONIC IMAGING, 1996, 2657 :350-360
[24]   Retinally reconstructed images: Digital images having a resolution match with the human eye [J].
Kuyel, T ;
Geisler, W ;
Ghosh, J .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART A-SYSTEMS AND HUMANS, 1999, 29 (02) :235-243
[25]  
Lee JY, 2001, IEEE T CIRC SYST VID, V11, P619, DOI 10.1109/76.920191
[26]   Foveated video quality assessment [J].
Lee, S ;
Pattichis, MS ;
Bovik, AC .
IEEE TRANSACTIONS ON MULTIMEDIA, 2002, 4 (01) :129-132
[27]   Foveated video compression with optimal rate control [J].
Lee, S ;
Pattichis, MS ;
Bovik, AC .
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2001, 10 (07) :977-992
[28]   MULTIFREQUENCY CHANNEL DECOMPOSITIONS OF IMAGES AND WAVELET MODELS [J].
MALLAT, SG .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1989, 37 (12) :2091-2110
[29]  
NGUYEN E, 1994, IEEE IMAGE PROC, P245, DOI 10.1109/ICIP.1994.413850
[30]   Lossless region of interest coding [J].
Nister, D ;
Christopoulos, C .
SIGNAL PROCESSING, 1999, 78 (01) :1-17