Spectral and temporal cues in cochlear implant speech perception

被引:118
作者
Nie, K
Barco, A
Zeng, FG
机构
[1] Univ Calif Irvine, Dept Biomed Engn & Otolaryngol, Hearing & Speech Res Lab, Irvine, CA USA
[2] Univ Calif Irvine, Dept Head & Neck Surg, Irvine, CA USA
[3] MED EL Corp, Durham, NC USA
关键词
D O I
10.1097/01.aud.0000202312.31837.25
中图分类号
R36 [病理学]; R76 [耳鼻咽喉科学];
学科分类号
100104 ; 100213 ;
摘要
Objective: Taking advantage of the flexibility in the number of stimulating electrodes and the stimulation rate in a modern cochlear implant, the present study evaluated relative contributions of spectral and temporal cues to cochlear implant speech perception. Design: Four experiments were conducted by using a Research Interface Box in five MED-EL COMBI 40+ cochlear implant users. Experiment I varied the number of electrodes from four to twelve or the maximal number of available active electrodes while keeping a constant stimulation rate at 1000 Hz per electrode. Experiment 2 varied the stimulation rate from 1000 to 4000 Hz per electrode on four pairs of fixed electrodes. Experiment 3 covaried the number of stimulating electrodes and the stimulation rate to study the trade-off between spectral and temporal cues. Experiment 4 studied the effects of envelope extraction on speech perception and listening preference, including half-wave rectification, full-wave rectification, and the Hilbert transform. Vowels, consonants, and HINT sentences in quiet, as well as with a competing female voice served as test materials. Results: Experiment 1 found significant improvement in all speech tests with a higher number of stimulating electrodes. Experiment 2 found a significant advantage of the high stimulation rate only on consonant recognition and sentence recognition in noise. Experiment 3 found an almost linear tradeoff between the number of stimulation electrodes and the stimulation rate for consonant and sentence recognition in quiet, but not for vowel and sentence recognition in noise. Experiment 4 found significantly better performance with the Hilbert transform and the full-wave rectification than the half-wave rectification. In addition, envelope extraction with the Hilbert transform produced the highest rating on subjective judgment of sound quality. Conclusions: Consistent with previous studies, the present result from the five MED-EL subjects showed that (1) the temporal envelope cues from a limited number of channels are sufficient to support high levels of phoneme and sentence recognition in quiet but not for speech recognition in a competing voice, (2) consonant recognition relies more on temporal cues while vowel recognition relies more on spectral cues, (3) spectral and temporal cues can be traded to some degree to produce similar performance in cochlear implant speech recognition, and (4) the Hilbert envelope improves both speech intelligibility and quality in cochlear implants.
引用
收藏
页码:208 / 217
页数:10
相关论文
共 42 条
[1]  
Anderson Ilona, 2002, Ear Nose Throat J, V81, P229
[2]  
BRILL SM, 1997, AM J OTOL, V18, P104
[3]   Channel interactions with high-rate biphasic electrical stimulation in cochlear implant subjects [J].
de Balthasar, C ;
Boëx, C ;
Cosendai, G ;
Valentini, G ;
Sigrist, A ;
Pelizzone, M .
HEARING RESEARCH, 2003, 182 (1-2) :77-87
[4]   The identification of speech in noise by cochlear implant patients and normal-hearing listeners using 6-channel signal processors [J].
Dorman, MF ;
Loizou, PC ;
Fitzke, J .
EAR AND HEARING, 1998, 19 (06) :481-484
[5]   The recognition of sentences in noise by normal-hearing listeners using simulations of cochlear-implant signal processors with 6-20 channels [J].
Dorman, MF ;
Loizou, PC ;
Fitzke, J ;
Tu, ZM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (06) :3583-3585
[6]   Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor [J].
Fishman, KE ;
Shannon, RV ;
Slattery, WH .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1997, 40 (05) :1201-1215
[7]   Speech recognition in noise as a function of the number of spectral channels: Comparison of acoustic hearing and cochlear implants [J].
Friesen, LM ;
Shannon, RV ;
Baskent, D ;
Wang, X .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2001, 110 (02) :1150-1163
[8]   Perceptual learning following changes in the frequency-to-electrode assignment with the Nucleus-22 cochlear implant [J].
Fu, QJ ;
Shannon, RV ;
Galvin, JJ .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2002, 112 (04) :1664-1674
[9]   Phoneme recognition by cochlear implant users as a function of signal-to-noise ratio and nonlinear amplitude mapping [J].
Fu, QJ ;
Shannon, RV .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1999, 106 (02) :L18-L23
[10]  
Garnham Carolyn, 2002, Ear and Hearing, V23, P540, DOI 10.1097/00003446-200212000-00005