A spectro-temporal modulation index (STMI) for assessment of speech intelligibility

被引:174
作者
Elhilali, M [1 ]
Chi, T [1 ]
Shamma, SA [1 ]
机构
[1] Univ Maryland, Dept Elect & Comp Engn, Syst Res Inst, College Pk, MD 20742 USA
关键词
modulation transfer function; spectro-temporal modulations; speech intelligibility; STMI;
D O I
10.1016/S0167-6393(02)00134-6
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
We present a biologically motivated method for assessing the intelligibility of speech recorded or transmitted under various types of distortions. The method employs an auditory model to analyze the effects of noise, reverberations, and other distortions on the joint spectro-temporal modulations present in speech, and on the ability of a channel to transmit these modulations. The effects are summarized by a spectro-temporal modulation index (STMI). The index is validated by comparing its predictions to those of the classical STI and to error rates reported by human subjects listening to speech contaminated with combined noise and reverberation. We further demonstrate that the STMI can handle difficult and nonlinear distortions such as phase-jitter and shifts, to which the STI is not sensitive. (C) 2002 Published by Elsevier B.V.
引用
收藏
页码:331 / 348
页数:18
相关论文
共 28 条
[1]  
[Anonymous], 1994, DIGITAL COMMUNICATIO
[2]  
ANSI, 1969, S351969 ANSI
[3]  
Arai T, 1996, ICSLP 96 - FOURTH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, PROCEEDINGS, VOLS 1-4, P2490, DOI 10.1109/ICSLP.1996.607318
[4]  
ATLAS L, 2001, ICASSP 2001
[5]  
Bellamy J., 2000, WILEY SERIES TELECOM
[6]   PREDICTORS OF SPEECH-INTELLIGIBILITY IN ROOMS [J].
BRADLEY, JS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1986, 80 (03) :837-845
[7]   Spectro-temporal modulation transfer functions and speech intelligibility [J].
Chi, TS ;
Gao, YJ ;
Guyton, MC ;
Ru, PW ;
Shamma, S .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (05) :2719-2732
[8]   A quantitative model of the ''effective'' signal processing in the auditory system .1. Model structure [J].
Dau, T ;
Puschel, D ;
Kohlrausch, A .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 99 (06) :3615-3622
[9]   Spectro-temporal response field characterization with dynamic ripples in ferret primary auditory cortex [J].
Depireux, DA ;
Simon, JZ ;
Klein, DJ ;
Shamma, SA .
JOURNAL OF NEUROPHYSIOLOGY, 2001, 85 (03) :1220-1234
[10]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064