DEMODULATION, PREDICTIVE CODING, AND SPATIAL VISION

被引:52
作者
DAUGMAN, JG
DOWNING, CJ
机构
[1] The Computer Laboratory, University of Cambridge, Cambridge, CB2 3QG, Pembroke Street
来源
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION | 1995年 / 12卷 / 04期
关键词
D O I
10.1364/JOSAA.12.000641
中图分类号
O43 [光学];
学科分类号
070207 ; 0803 ;
摘要
We argue that some aspects of human spatial vision, particularly for textured patterns and scenes, can be described in terms of demodulation and predictive coding. Such nonlinear processes encode a pattern into local phasors that represent it completely as a modulation, in phase and amplitude, of a prediction associated with the image structure in some region by its predominant undulation(s). The demodulation representation of a pattern is an anisotropic, second-order form of predictive coding, and it offers a particularly efficient way to analyze and encode textures, as it identifies and exploits their underlying redundancies. In addition, self-consistent: domains of redundancy in image structure provide a basis for image segmentation. We first provide an algorithm for computing the three elements of a complete demodulation transform of any image, and are illustrate such decompositions for both natural and synthetic images. We then present psychophysical evidence from spatial masking experiments, as well as illustrations of perceptual organization, that suggest a Possible role for such underlying representations in human vision. In psychophysical experiments employing masks with more than two oriented Fourier components, we find that peaks of threshold elevation occur at locations in the Fourier plane remote from the orientations and frequencies of the actual mask components. Rather, as would occur from demodulation, these peaks in the frequency plane are related to the vector difference frequencies between the actual masking components and their spectral centers of mass. We offer a neural interpretation of demodulation coding, and finally we demonstrate a practical application of this process in a system for automatic visual recognition of personal identity by demodulation of a facial feature.
引用
收藏
页码:641 / 660
页数:20
相关论文
共 48 条
[1]   STRIATE CORTEX RESPONSES TO PERIODIC PATTERNS WITH AND WITHOUT THE FUNDAMENTAL HARMONICS [J].
ALBRECHT, DG ;
DEVALOIS, RL .
JOURNAL OF PHYSIOLOGY-LONDON, 1981, 319 (OCT) :497-514
[2]   SOME INFORMATIONAL ASPECTS OF VISUAL PERCEPTION [J].
ATTNEAVE, F .
PSYCHOLOGICAL REVIEW, 1954, 61 (03) :183-193
[3]  
Barlow H. B., 1961, P331
[4]   MULTICHANNEL TEXTURE ANALYSIS USING LOCALIZED SPATIAL FILTERS [J].
BOVIK, AC ;
CLARK, M ;
GEISLER, WS .
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 1990, 12 (01) :55-73
[5]   ANALYSIS OF MULTICHANNEL NARROW-BAND-FILTERS FOR IMAGE TEXTURE SEGMENTATION [J].
BOVIK, AC .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1991, 39 (09) :2025-2043
[6]   EVIDENCE FOR NONLINEAR RESPONSE PROCESSES IN HUMAN VISUAL SYSTEM FROM MEASUREMENTS ON THRESHOLDS OF SPATIAL BEAT FREQUENCIES [J].
BURTON, GJ .
VISION RESEARCH, 1973, 13 (07) :1211-1225
[7]   ENERGY PROCESSING AND CODING FACTORS IN TEXTURE-DISCRIMINATION AND IMAGE-PROCESSING [J].
CAELLI, T .
PERCEPTION & PSYCHOPHYSICS, 1983, 34 (04) :349-355
[8]  
Daugman J., 1994, United States Patent, Patent No. 5291560
[9]  
Daugman J. G., 1987, Proceedings of the SPIE - The International Society for Optical Engineering, V758, P19
[10]   TWO-DIMENSIONAL SPECTRAL-ANALYSIS OF CORTICAL RECEPTIVE-FIELD PROFILES [J].
DAUGMAN, JG .
VISION RESEARCH, 1980, 20 (10) :847-856