WHAT IS THE GOAL OF SENSORY CODING

被引:805
作者
FIELD, DJ
机构
关键词
D O I
10.1162/neco.1994.6.4.559
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A number of recent attempts have been made to describe early sensory coding in terms of a general information processing strategy. In this paper, two strategies are contrasted. Both strategies take advantage of the redundancy in the environment to produce more effective representations. The first is described as a ''compact'' coding scheme. compact code performs a transform that allows the input to be represented with a reduced number of vectors (cells) with minimal RMS error. This approach has recently become popular in the neural network literature and is related to a process called Principal Components Analysis (PCA). A number of recent papers have suggested that the optimal ''compact'' code for representing natural scenes will have units with receptive field profiles much like those found in the retina and primary visual cortex. However, in this paper, it is proposed that compact coding schemes are insufficient to account for the receptive field properties of cells in the mammalian visual pathway. In contrast, it is proposed that the visual system is near to optimal in representing natural scenes only if optimality is defined in terms of ''sparse distributed'' coding. In a sparse distributed code, all cells in the code have an equal response probability across the class of images but have a low response probability for any single image. In such a code, the dimensionality is not reduced. Rather, the redundancy of the input is transformed into the redundancy of the firing pattern of cells. It is proposed that the signature for a sparse code is found in the fourth moment of the response distribution (i.e., the kurtosis). In measurements with 55 calibrated natural scenes, the kurtosis was found to peak when the bandwidths of the visual code matched those of cells in the mammalian visual cortex. Codes resembling ''wavelet transforms'' are proposed to be effective because the response histograms of such codes are sparse (i.e., show high kurtosis) when presented with natural scenes. It is proposed that the structure of the image that allows sparse coding is found in the phase spectrum of the image. It is suggested that natural scenes, to a first approximation, can be considered as a sum of self-similar local functions (the inverse of a wavelet). Possible reasons for why sensory systems would evolve toward sparse coding are presented.
引用
收藏
页码:559 / 601
页数:43
相关论文
共 71 条
[1]  
ADELSON E., 1987, SPIE VISUAL COMMUN I, VII, P845
[2]   WHAT DOES THE RETINA KNOW ABOUT NATURAL SCENES [J].
ATICK, JJ ;
REDLICH, AN .
NEURAL COMPUTATION, 1992, 4 (02) :196-210
[3]  
ATICK JJ, 1992, NETWORK-COMP NEURAL, V3, P213, DOI [10.1088/0954-898X/3/2/009, 10.3109/0954898X.2011.638888]
[4]   WHAT DOES POSTADAPTATION COLOR APPEARANCE REVEAL ABOUT CORTICAL COLOR REPRESENTATION [J].
ATICK, JJ ;
LI, ZP ;
REDLICH, AN .
VISION RESEARCH, 1993, 33 (01) :123-129
[5]   A STATISTICAL-ANALYSIS OF NATURAL IMAGES MATCHES PSYCHOPHYSICALLY DERIVED ORIENTATION TUNING CURVES [J].
BADDELEY, RJ ;
HANCOCK, PJB .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1991, 246 (1317) :219-223
[6]  
BARLOW H, 1989, COMP NEUR S, P54
[7]   Finding Minimum Entropy Codes [J].
Barlow, H. B. ;
Kaushal, T. P. ;
Mitchison, G. J. .
NEURAL COMPUTATION, 1989, 1 (03) :412-423
[8]   Unsupervised Learning [J].
Barlow, H. B. .
NEURAL COMPUTATION, 1989, 1 (03) :295-311
[9]  
Barlow H B, 1972, Perception, V1, P371, DOI 10.1068/p010371
[10]  
Barlow H. B., 1961, CURRENT PROBLEMS ANI