Multiresolution spectrotemporal analysis of complex sounds

被引:443
作者
Chi, T [1 ]
Ru, PW [1 ]
Shamma, SA [1 ]
机构
[1] Univ Maryland, Ctr Auditory & Acoust Res, Syst Res Inst, Dept Elect & Comp Engn, College Pk, MD 20742 USA
关键词
D O I
10.1121/1.1945807
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
A computational model of auditory analysis is described that is inspired by psychoacoustical and neurophysiological findings in early and central stages of the auditory system. The model provides a unified multiresolution representation of the spectral and temporal features likely critical in the perception of sound. Simplified, more specifically tailored versions of this model have already been validated by successful application in the assessment of speech intelligibility [Elhilali et al., Speech Commun. 41(2-3), 331-348 (2003); Chi et al., J. Acoust. Soc. Am. 106, 2719-2732 (1999)] and in explaining the perception of monaural phase sensitivity [R. Carlyon and S. Shamma, J. Acoust. Soc. Am. 114, 333-348 (2003)]. Here we provide a more complete mathematical formulation of the model, illustrating how complex signals are transformed through various stages of the model, and relating it to comparable existing models of auditory processing. Furthermore, we outline several reconstruction algorithms to resynthesize the sound from the model output so as to evaluate the fidelity of the representation and contribution of different features and cues to the sound percept. (C) 2005 Acoustical Society of America.
引用
收藏
页码:887 / 906
页数:20
相关论文
共 94 条
[21]  
DERIBAUPIERRE F, 1981, J PHYSIOL-LONDON, V318, pP23
[22]   TEMPORAL ENVELOPE AND FINE-STRUCTURE CUES FOR SPEECH-INTELLIGIBILITY [J].
DRULLMAN, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :585-592
[23]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064
[24]   DISTRIBUTION OF COMBINATION-SENSITIVE NEURONS IN THE VENTRAL FRINGE AREA OF THE AUDITORY-CORTEX OF THE MUSTACHED BAT [J].
EDAMATSU, H ;
KAWASAKI, M ;
SUGA, N .
JOURNAL OF NEUROPHYSIOLOGY, 1989, 61 (01) :202-207
[25]   Temporal modulation transfer functions in cat primary auditory cortex: Separating stimulus effects from neural mechanisms [J].
Eggermont, JJ .
JOURNAL OF NEUROPHYSIOLOGY, 2002, 87 (01) :305-321
[26]   Dynamics of precise spike timing in primary auditory cortex [J].
Elhilali, M ;
Fritz, JB ;
Klein, DJ ;
Simon, JZ ;
Shamma, SA .
JOURNAL OF NEUROSCIENCE, 2004, 24 (05) :1159-1172
[27]   A spectro-temporal modulation index (STMI) for assessment of speech intelligibility [J].
Elhilali, M ;
Chi, T ;
Shamma, SA .
SPEECH COMMUNICATION, 2003, 41 (2-3) :331-348
[28]   Characterizing frequency selectivity for envelope fluctuations [J].
Ewert, SD ;
Dau, T .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (03) :1181-1196
[29]   PHASE RETRIEVAL ALGORITHMS - A COMPARISON [J].
FIENUP, JR .
APPLIED OPTICS, 1982, 21 (15) :2758-2769
[30]   PHASE-RETRIEVAL STAGNATION PROBLEMS AND SOLUTIONS [J].
FIENUP, JR ;
WACKERMAN, CC .
JOURNAL OF THE OPTICAL SOCIETY OF AMERICA A-OPTICS IMAGE SCIENCE AND VISION, 1986, 3 (11) :1897-1907