Sparse representations for the cocktail party problem

被引:37
作者
Asari, Hiroki
Pearlmutter, Barak A.
Zador, Anthony M.
机构
[1] Cold Spring Harbor Lab, Cold Spring Harbor, NY 11724 USA
[2] Watson Sch Biol Sci, Cold Spring Harbor, NY 11724 USA
[3] Natl Univ Ireland, Hamilton Inst, Maynooth, Kildare, Ireland
关键词
auditory processing; optimality; receptive field; sparse coding; stream segregation; cortical representation; BLIND SOURCE SEPARATION; RECEPTIVE-FIELDS; AUDITORY-CORTEX; SOUND LOCALIZATION; RESPONSES; STATISTICS; NEURONS; DECOMPOSITION; SEQUENCES; FEATURES;
D O I
10.1523/JNEUROSCI.1563-06.2006
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
A striking feature of many sensory processing problems is that there appear to be many more neurons engaged in the internal representations of the signal than in its transduction. For example, humans have similar to 30,000 cochlear neurons, but at least 1000 times as many neurons in the auditory cortex. Such apparently redundant internal representations have sometimes been proposed as necessary to overcome neuronal noise. We instead posit that they directly subserve computations of interest. Here we provide an example of how sparse overcomplete linear representations can directly solve difficult acoustic signal processing problems, using as an example monaural source separation using solely the cues provided by the differential filtering imposed on a source by its path from its origin to the cochlea [the head-related transfer function (HRTF)]. In contrast to much previous work, the HRTF is used here to separate auditory streams rather than to localize them in space. The experimentally testable predictions that arise from this model, including a novel method for estimating the optimal stimulus of a neuron using data from a multineuron recording experiment, are generic and apply to a wide range of sensory computations.
引用
收藏
页码:7477 / 7490
页数:14
相关论文
共 69 条
[1]   Adaptive blind signal processing - Neural network approaches [J].
Amari, SI ;
Cichocki, A .
PROCEEDINGS OF THE IEEE, 1998, 86 (10) :2026-2048
[2]  
Attias H, 1997, ADV NEUR IN, V9, P27
[3]   Responses of neurons in primary and inferior temporal visual cortices to natural scenes [J].
Baddeley, R ;
Abbott, LF ;
Booth, MCA ;
Sengpiel, F ;
Freeman, T ;
Wakeman, EA ;
Rolls, ET .
PROCEEDINGS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 1997, 264 (1389) :1775-1783
[4]   Auditory cortical responses elicited in awake primates by random spectrum stimuli [J].
Barbour, DL ;
Wang, XQ .
JOURNAL OF NEUROSCIENCE, 2003, 23 (18) :7194-7206
[5]  
Barlow HB., 1961, SENS COMMUN, V1, P217, DOI DOI 10.7551/MITPRESS/9780262518420.003.0013
[6]   AN INFORMATION MAXIMIZATION APPROACH TO BLIND SEPARATION AND BLIND DECONVOLUTION [J].
BELL, AJ ;
SEJNOWSKI, TJ .
NEURAL COMPUTATION, 1995, 7 (06) :1129-1159
[7]   The ''independent components'' of natural scenes are edge filters [J].
Bell, AJ ;
Sejnowski, TJ .
VISION RESEARCH, 1997, 37 (23) :3327-3338
[8]   A blind source separation technique using second-order statistics [J].
Belouchrani, A ;
AbedMeraim, K ;
Cardoso, JF ;
Moulines, E .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1997, 45 (02) :434-444
[9]   READING A NEURAL CODE [J].
BIALEK, W ;
RIEKE, F ;
VANSTEVENINCK, RRD ;
WARLAND, D .
SCIENCE, 1991, 252 (5014) :1854-1857
[10]   Underdetermined blind source separation using sparse representations [J].
Bofill, P ;
Zibulevsky, M .
SIGNAL PROCESSING, 2001, 81 (11) :2353-2362