A spectro-temporal modulation index (STMI) for assessment of speech intelligibility

被引:174
作者
Elhilali, M [1 ]
Chi, T [1 ]
Shamma, SA [1 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, Syst Res Inst, College Pk, MD 20742 USA
关键词
modulation transfer function; spectro-temporal modulations; speech intelligibility; STMI;
D O I
10.1016/S0167-6393(02)00134-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a biologically motivated method for assessing the intelligibility of speech recorded or transmitted under various types of distortions. The method employs an auditory model to analyze the effects of noise, reverberations, and other distortions on the joint spectro-temporal modulations present in speech, and on the ability of a channel to transmit these modulations. The effects are summarized by a spectro-temporal modulation index (STMI). The index is validated by comparing its predictions to those of the classical STI and to error rates reported by human subjects listening to speech contaminated with combined noise and reverberation. We further demonstrate that the STMI can handle difficult and nonlinear distortions such as phase-jitter and shifts, to which the STI is not sensitive. (C) 2002 Published by Elsevier B.V.
引用
收藏
页码:331 / 348
页数:18
相关论文
共 28 条
[11]  
Greenberg S., 1998, J ACOUST SOC AM, V103, P3057, DOI [10.1121/1.422679, DOI 10.1121/1.422679]
[12]  
GREENBERG S, 1998, P INT C SPOK LANG PR
[13]   RASTA Processing of Speech [J].
Hermansky, Hynek ;
Morgan, Nelson .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :578-589
[14]  
HOUTGAST T, 1980, ACUSTICA, V46, P60
[15]   A REVIEW OF THE MTF CONCEPT IN ROOM ACOUSTICS AND ITS USE FOR ESTIMATING SPEECH-INTELLIGIBILITY IN AUDITORIA [J].
HOUTGAST, T ;
STEENEKEN, HJM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (03) :1069-1077
[16]   Analysis of dynamic spectra in ferret primary auditory cortex .1. Characteristics of single-unit responses to moving ripple spectra [J].
Kowalski, N ;
Depireux, DA ;
Shamma, SA .
JOURNAL OF NEUROPHYSIOLOGY, 1996, 76 (05) :3503-3523
[17]   METHODS FOR CALCULATION AND USE OF ARTICULATION INDEX [J].
KRYTER, KD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (11) :1689-&
[18]  
LYON R, 1996, SPR HDB AUD, V6, P221
[19]   Effect of reducing temporal intensity modulations on sentence intelligibility [J].
Noordhoek, IM ;
Drullman, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 101 (01) :498-502
[20]   A method to determine the speech transmission index from speech waveforms [J].
Payton, KL ;
Braida, LD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (06) :3637-3648