Modulation spectra of natural sounds and ethological theories of auditory processing

被引:340
作者
Singh, NC
Theunissen, FE
机构
[1] Univ Calif Berkeley, Dept Psychol, Berkeley, CA 94720 USA
[2] Univ Calif Berkeley, Inst Neurosci, Berkeley, CA 94720 USA
关键词
D O I
10.1121/1.1624067
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The modulation statistics of natural sound ensembles were analyzed by calculating the probability distributions of the amplitude envelope of the sounds and their time-frequency correlations given by the modulation spectra. These modulation spectra were obtained by calculating the two-dimensional Fourier transform of the autocorrelation matrix of the sound stimulus in its spectrographic representation. Since temporal bandwidth and spectral bandwidth are conjugate variables, it is shown that the joint modulation spectrum of sound occupies a restricted space: sounds cannot have rapid temporal and spectral modulations simultaneously. Within this restricted space, it is shown that natural sounds have a characteristic signature. Natural sounds, in general, are low-passed, showing most of their modulation energy for low temporal and spectral modulations. Animal vocalizations and human speech are further characterized by the fact that most of the spectral modulation power is found only for low temporal modulation. Similarly, the distribution of the amplitude envelopes also exhibits characteristic shapes for natural sounds, reflecting the high probability of epochs with no sound, systematic differences across frequencies, and a relatively uniform distribution for the log of the amplitudes for vocalizations. It is postulated that the auditory system as well as engineering applications may exploit these statistical properties to obtain an efficient representation of behaviorally relevant sounds. To test such a hypothesis we show how to create synthetic sounds with first and second order envelope statistics identical to those found in natural sounds. (C) 2003 Acoustical Society of America.
引用
收藏
页码:3394 / 3411
页数:18
相关论文
共 55 条
[21]   Selectivity for conspecific song in the zebra finch auditory forebrain [J].
Grace, JA ;
Amin, N ;
Singh, NC ;
Theunissen, FE .
JOURNAL OF NEUROPHYSIOLOGY, 2003, 89 (01) :472-487
[22]  
Green D., 1986, AUDITORY FREQUENCY S, P351
[23]   SIGNAL ESTIMATION FROM MODIFIED SHORT-TIME FOURIER-TRANSFORM [J].
GRIFFIN, DW ;
LIM, JS .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1984, 32 (02) :236-243
[24]   Robust spectrotemporal reverse correlation for the auditory system: Optimizing stimulus design [J].
Klein D.J. ;
Depireux D.A. ;
Simon J.Z. ;
Shamma S.A. .
Journal of Computational Neuroscience, 2000, 9 (01) :85-111
[25]   Efficient coding of natural sounds [J].
Lewicki, MS .
NATURE NEUROSCIENCE, 2002, 5 (04) :356-363
[26]   Detection of changes in timbre and harmonicity in complex sounds by zebra finches (Taeniopygia guttata) and budgerigars (Melopsittacus undulatus) [J].
Lohr, B ;
Dooling, RJ .
JOURNAL OF COMPARATIVE PSYCHOLOGY, 1998, 112 (01) :36-47
[27]   Representation of acoustic communication signals by insect auditory receptor neurons [J].
Machens, CK ;
Stemmler, MB ;
Prinz, P ;
Krahe, R ;
Ronacher, B ;
Herz, AVM .
JOURNAL OF NEUROSCIENCE, 2001, 21 (09) :3215-3227
[28]  
MARGOLIASH D, 1983, J NEUROSCI, V3, P1039
[29]   Spectrotemporal receptive fields in the lemniscal auditory thalamus and cortex [J].
Miller, LM ;
Escabí, MA ;
Read, HL ;
Schreiner, CE .
JOURNAL OF NEUROPHYSIOLOGY, 2002, 87 (01) :516-527
[30]  
Newman J., 1978, BRAIN RES, V54, P287