Speech recognition with altered spectral distribution of envelope cues

被引:162
作者
Shannon, RV [1 ]
Zeng, FG [1 ]
Wygonski, J [1 ]
机构
[1] House Ear Inst, Los Angeles, CA 90057 USA
关键词
D O I
10.1121/1.423774
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of consonants, vowels, and sentences was measured in conditions of reduced spectral resolution and distorted spectral distribution of temporal envelope cues. Speech materials were processed through four bandpass filters (analysis bands), half-wave rectified, and low-pass filtered to extract the temporal envelope from each band. The envelope from each speech band modulated a band-limited noise (carrier bands). Analysis and carrier bands were manipulated independently to alter the spectral distribution of envelope cues. Experiment I demonstrated that the location df the cutoff frequencies defining the bands was not a critical parameter for speech recognition, as long as the analysis and carrier bands were matched in frequency extent. Experiment II demonstrated a dramatic decrease in performance when the analysis and carrier bands did not match in frequency extent, which resulted in a warping of the spectral distribution of envelope cues. Experiment III demonstrated a large decrease in performance when the carrier bands were shifted in frequency, mimicking the basal position of electrodes in a cochlear implant. And experiment IV showed a relatively minor effect of the overlap in the noise carrier bands, simulating the overlap in neural populations responding to adjacent electrodes in a cochlear implant. Overall, these results show that, for four bands, the frequency alignment of the analysis bands and carrier bands is critical for good performance, while the exact frequency divisions and overlap in carrier bands are not as critical. (C) 1998 Acoustical Society of America. [S0001-4966(98)02210-3].
引用
收藏
页码:2467 / 2476
页数:10
相关论文
共 47 条
[1]  
[Anonymous], 1964, FORMATION TRANSFORMA
[2]  
[Anonymous], 1987, Iowa Audiovisual Speech- Perception Tests
[3]   SPEECH PERCEPTION UNDER CONDITIONS OF SPECTRAL TRANSFORMATION .1. PHONETIC CHARACTERISTICS [J].
BLESSER, B .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1972, 15 (01) :5-&
[4]   Effects of spectral smearing on phoneme and word recognition [J].
Boothroyd, A ;
Mulhearn, B ;
Gong, J ;
Ostroff, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1996, 100 (03) :1807-1818
[5]  
BOOTHROYD A, 1985, RCI10 CIT U NEW YORK
[6]  
Bredberg G., 1995, Annals of Otology Rhinology and Laryngology, V104, P256
[7]   Simulating the effect of cochlear-implant electrode insertion depth on speech understanding [J].
Dorman, MF ;
Loizou, PC ;
Rainey, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (05) :2993-2996
[8]   Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs [J].
Dorman, MF ;
Loizou, PC ;
Rainey, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (04) :2403-2411
[9]   The identification of consonants and vowels by cochlear implant patients using a 6-channel continuous interleaved sampling processor and by normal-hearing subjects using simulations of processors with two to nine channels [J].
Dorman, MF ;
Loizou, PC .
EAR AND HEARING, 1998, 19 (02) :162-166
[10]   TEMPORAL ENVELOPE AND FINE-STRUCTURE CUES FOR SPEECH-INTELLIGIBILITY [J].
DRULLMAN, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :585-592