Determination of the potential benefit of time-frequency gain manipulation

被引:83
作者
Anzalone, Michael C.
Calandruccio, Lauren
Doherty, Karen A.
Carney, Laurel H.
机构
[1] Syracuse Univ, Dept Biomed & Chem Engn, Syracuse, NY 13244 USA
[2] Syracuse Univ, Dept Speech & Commun Disorders, Syracuse, NY 13244 USA
[3] Syracuse Univ, Inst Sensory Res, Dept Comp Sci, Syracuse, NY 13244 USA
关键词
D O I
10.1097/01.aud.0000233891.86809.df
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective: The purpose of this study was to determine the maximum benefit provided by a time-frequency gain-manipulation algorithm for noise-reduction (NR) based on an ideal detector of speech energy. The amount of detected energy necessary to show benefit using this type of NR algorithm was examined, as well as the necessary speed and frequency resolution of the gain manipulation. Design: NR was performed using time-frequency gain manipulation, wherein the gains of individual frequency bands depended on the absence or presence of speech energy within each band. Three different experiments were performed: (1) NR using ideal detectors, (2) NR with nonideal detectors, and (3) NR with ideal detectors and different processing speeds and frequency resolutions. All experiments were performed using the Hearing-in-Noise test (HINT). A total of 6 listeners with normal hearing and 14 listeners with hearing loss were tested. Results: HINT thresholds improved for all listeners with NR based on the ideal detectors used in Experiment 1. The nonideal detectors of Experiment II required detection of at least 90% of the speech energy before an improvement was seen in HINT thresholds. The results of Experiment III demonstrated that relatively high temporal resolution (< 100 msec) was required by the NR algorithm to improve HINT thresholds. Conclusions: The results indicated that a singlemicrophone NR system based on time-frequency gain manipulation improved the HINT thresholds of listeners. However, to obtain benefit in speech intelligibility, the detectors used in such a strategy were required to detect an unrealistically high percentage of the speech energy and to perform the gain manipulations on a fast temporal basis.
引用
收藏
页码:480 / 492
页数:13
相关论文
共 50 条
[1]  
[Anonymous], 1997, S35 ANSI
[2]  
[Anonymous], 1969, S35 ANSI
[3]  
[Anonymous], P IEEE INT C AC SPEE
[4]  
ANSI, 1989, S36 ANSI
[5]   MODULATION DETECTION IN SUBJECTS WITH RELATIVELY FLAT HEARING LOSSES [J].
BACON, SP ;
GLEITMAN, RM .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1992, 35 (03) :642-653
[6]   SUPPRESSION OF ACOUSTIC NOISE IN SPEECH USING SPECTRAL SUBTRACTION [J].
BOLL, SF .
IEEE TRANSACTIONS ON ACOUSTICS SPEECH AND SIGNAL PROCESSING, 1979, 27 (02) :113-120
[7]  
CARHART R, 1970, ARCHIV OTOLARYNGOL, V91, P273
[8]  
Chabries D, 2002, EL EN AP SI, P379
[9]   Robust automatic speech recognition with missing and unreliable acoustic data [J].
Cooke, M ;
Green, P ;
Josifovski, L ;
Vizinho, A .
SPEECH COMMUNICATION, 2001, 34 (03) :267-285
[10]   EFFECT OF AUTOMATIC SIGNAL-PROCESSING AMPLIFICATION ON SPEECH RECOGNITION IN NOISE FOR PERSONS WITH SENSORINEURAL HEARING-LOSS [J].
DEMPSEY, JJ .
ANNALS OF OTOLOGY RHINOLOGY AND LARYNGOLOGY, 1987, 96 (03) :251-253