A real-time foveated multiresolution system for low-bandwidth video communication

被引:230
作者
Geisler, WS [1 ]
Perry, JS [1 ]
机构
[1] Univ Texas, Ctr Vis & Image Sci, Austin, TX 78712 USA
来源
HUMAN VISION AND ELECTRONIC IMAGING III | 1998年 / 3299卷
关键词
foveation; foveated imaging; multiresolution pyramid; video; motion compensation; zero-tree coding; human vision; eye tracking; video compression;
D O I
10.1117/12.320120
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
Foveated imaging exploits the fact that the spatial resolution of the human visual system decreases dramatically away from the point of gaze. Because of this fact, large bandwidth savings are obtained by matching the resolution of the transmitted image to the fall-off in resolution of the human visual system. We have developed a foveated multiresolution pyramid (FMP) video coder/decoder which runs in real-time on a general purpose computer (i.e., a Pentium with the Windows 95/NT OS). The current system uses a foveated multiresolution pyramid to code each image into 5 or 6 regions of varying resolution. The user-controlled foveation point is obtained from a pointing device (e.g., a mouse or an eyetracker). Spatial edge artifacts between the regions created by the foveation are eliminated by raised-cosine blending across levels of the pyramid, and by "foveation point interpolation" within levels of the pyramid. Each level of the pyramid is then motion compensated, multiresolution pyramid coded, and thresholded/quantized based upon human contrast sensitivity as a function of spatial frequency and retinal eccentricity. The final lossless coding includes zero-tree coding. Optimal use of foveated imaging requires eye tracking; however, there are many useful applications which do not require eye tracking.
引用
收藏
页码:294 / 305
页数:12
相关论文
empty
未找到相关数据