Sparse representations for the cocktail party problem

被引:37
作者
Asari, Hiroki
Pearlmutter, Barak A.
Zador, Anthony M.
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[2] Watson Sch Biol Sci, Cold Spring Harbor, NY 11724 USA
[3] Natl Univ Ireland, Hamilton Inst, Maynooth, Kildare, Ireland
关键词
auditory processing; optimality; receptive field; sparse coding; stream segregation; cortical representation; BLIND SOURCE SEPARATION; RECEPTIVE-FIELDS; AUDITORY-CORTEX; SOUND LOCALIZATION; RESPONSES; STATISTICS; NEURONS; DECOMPOSITION; SEQUENCES; FEATURES;
D O I
10.1523/JNEUROSCI.1563-06.2006
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A striking feature of many sensory processing problems is that there appear to be many more neurons engaged in the internal representations of the signal than in its transduction. For example, humans have similar to 30,000 cochlear neurons, but at least 1000 times as many neurons in the auditory cortex. Such apparently redundant internal representations have sometimes been proposed as necessary to overcome neuronal noise. We instead posit that they directly subserve computations of interest. Here we provide an example of how sparse overcomplete linear representations can directly solve difficult acoustic signal processing problems, using as an example monaural source separation using solely the cues provided by the differential filtering imposed on a source by its path from its origin to the cochlea [the head-related transfer function (HRTF)]. In contrast to much previous work, the HRTF is used here to separate auditory streams rather than to localize them in space. The experimentally testable predictions that arise from this model, including a novel method for estimating the optimal stimulus of a neuron using data from a multineuron recording experiment, are generic and apply to a wide range of sensory computations.
引用
收藏
页码:7477 / 7490
页数:14
相关论文
共 69 条
[41]   Analysis of sparse representation and blind source separation [J].
Li, YQ ;
Cichocki, A ;
Amari, S .
NEURAL COMPUTATION, 2004, 16 (06) :1193-1234
[42]   Spectrotemporal structure of receptive fields in areas AI and AAF of mouse auditory cortex [J].
Linden, JF ;
Liu, RC ;
Sahani, M ;
Schreiner, CE ;
Merzenich, MM .
JOURNAL OF NEUROPHYSIOLOGY, 2003, 90 (04) :2660-2675
[43]  
LINSKER R, 2001, Patent No. 6317703
[44]   Linearity of cortical receptive fields measured with natural sounds [J].
Machens, CK ;
Wehr, MS ;
Zador, AM .
JOURNAL OF NEUROSCIENCE, 2004, 24 (05) :1089-1100
[45]   Perceptual organization of tone sequences in the auditory cortex of awake Macaques [J].
Micheyl, C ;
Tian, B ;
Carlyon, RP ;
Rauschecker, JP .
NEURON, 2005, 48 (01) :139-148
[46]   Responses of auditory-cortex neurons to structural features of natural sounds [J].
Nelken, I ;
Rotman, Y ;
Bar Yosef, O .
NATURE, 1999, 397 (6715) :154-157
[47]  
NISHINO T, 2001, IEICE T A, V84, P260
[48]   Sparse coding of sensory inputs [J].
Olshausen, BA ;
Field, DJ .
CURRENT OPINION IN NEUROBIOLOGY, 2004, 14 (04) :481-487
[49]   Sparse coding with an overcomplete basis set: A strategy employed by V1? [J].
Olshausen, BA ;
Field, DJ .
VISION RESEARCH, 1997, 37 (23) :3311-3325
[50]   Emergence of simple-cell receptive field properties by learning a sparse code for natural images [J].
Olshausen, BA ;
Field, DJ .
NATURE, 1996, 381 (6583) :607-609