An information theory of visual communication

被引:35
作者
Huck, FO [1 ]
Fales, CL [1 ]
Rahman, Z [1 ]
机构
[1] SCI & TECHNOL CORP, HAMPTON, VA 23666 USA
来源
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 1996年 / 354卷 / 1716期
关键词
D O I
10.1098/rsta.1996.0098
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The fundamental problem of visual communication is that of producing the best possible picture at the lowest data rate. We address this problem by extending information theory to the assessment of the visual communication channel as a whole, from image gathering to display. The extension unites two disciplines, the electrooptical design of image gathering and display devices and the digital processing for image coding and restoration. The mathematical development leads to several intuitively attractive figures of merit for assessing the visual communication channel as a function of the critical limiting factors that constrain its performance. Multiresolution decomposition is included in the mathematical development to optimally combine the economical encoding of the transmitted signal with image gathering and restoration. Quantitative and qualitative assessments demonstrate that a visual communication channel ordinarily can be expected to produce the best possible picture at the lowest data rate only if the image-gathering device produces the maximum-realizable information rate and the image-restoration algorithm properly accounts for the critical limiting factors that constrain the visual communication. These assessments encompass (a) the electro-optical design of the image-gathering device in terms of the trade-off between blurring and aliasing in the presence of photodetector and quantization noises, (b) the compression of data transmission by redundancy reduction, (c) the robustness of the image restoration to uncertainties in the statistical properties of the captured radiance field, and (d) the enhancement of particular features or, more generally, of the visual quality of the observed image. The 'best visual quality' in this context normally implies a compromise among maximum-realizable fidelity, sharpness, and clarity which depends on the characteristics of the scene and the purpose of the visual communication (e.g. diagnosis versus entertainment).
引用
收藏
页码:2193 / 2248
页数:56
相关论文
共 54 条
[1]  
Akansu A.N., 1992, Multiresolution Signal Decomposition: Transforms, Subbands, and Wavelets
[2]  
ALTERGARTENBERG R, 1992, COMPUTER VISION IMAG
[3]  
Andrews HC, 1977, DIGITAL IMAGE RESTOR
[4]  
[Anonymous], FUNDAMENTALS ELECT I
[5]   THE FERRIER LECTURE, 1980 - CRITICAL LIMITING FACTORS IN THE DESIGN OF THE EYE AND VISUAL-CORTEX [J].
BARLOW, HB .
PROCEEDINGS OF THE ROYAL SOCIETY SERIES B-BIOLOGICAL SCIENCES, 1981, 212 (1186) :1-34
[6]  
BOWKER DE, 1985, NASA REF PUBL, V1139
[7]  
Boyd R.W., 1983, Radiometry and the detection of optical radiation
[8]   THE LAPLACIAN PYRAMID AS A COMPACT IMAGE CODE [J].
BURT, PJ ;
ADELSON, EH .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1983, 31 (04) :532-540
[9]   THE WAVELET TRANSFORM, TIME-FREQUENCY LOCALIZATION AND SIGNAL ANALYSIS [J].
DAUBECHIES, I .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1990, 36 (05) :961-1005
[10]   ORTHONORMAL BASES OF COMPACTLY SUPPORTED WAVELETS [J].
DAUBECHIES, I .
COMMUNICATIONS ON PURE AND APPLIED MATHEMATICS, 1988, 41 (07) :909-996