Spectral and temporal cues to pitch in noise-excited vocoder simulations of continuous-interleaved-sampling cochlear implants

被引:71
作者
Green, T [1 ]
Faulkner, A [1 ]
Rosen, S [1 ]
机构
[1] UCL, Dept Phonet & Linguist, London NW1 2HE, England
关键词
D O I
10.1121/1.1506688
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Four-band and single-band noise-excited vocoders were used in acoustic simulations to investigate spectral and temporal cues to melodic pitch in the output of a cochlear implant speech processor. Noise carriers were modulated by amplitude envelopes extracted by half-wave rectification and low-pass filtering at 32 or 400 Hz. The four-band, but not the single-band processors, may preserve spectral correlates of fundamental frequency (F0). Envelope smoothing at 400 Hz preserves temporal correlates of F0, which are eliminated with 32-Hz smoothing. Inputs to the processors were sawtooth frequency glides, in which spectral variation is completely determined by F0, or synthetic diphthongal vowel glides, whose spectral shape is dominated by varying formant resonances. Normal listeners labeled the direction of pitch movement of the processed stimuli. For processed sawtooth waves, purely temporal cues led to decreasing performance with increasing F0. With purely spectral cues, performance was above chance despite the limited spectral resolution of the processors. For processed diphthongs, performance with purely spectral cues was at chance, showing that spectral envelope changes due to formant movement obscured spectral cues to F0. Performance with temporal cues was poorer for diphthongs than for sawtooths, with very limited discrimination at higher F0. These data suggest that, for speech signals through a typical cochlear implant processor, spectral cues to pitch are likely to have limited utility, while temporal envelope cues may be useful only at low F0. (C) 2002 Acoustical Society of America.
引用
收藏
页码:2155 / 2164
页数:10
相关论文
共 35 条
[1]   INTONATION AND SPEAKER IDENTIFICATION [J].
ABBERTON, E ;
FOURCIN, AJ .
LANGUAGE AND SPEECH, 1978, 21 (OCT-) :305-318
[2]   PLAYED-AGAIN SAM - FURTHER OBSERVATIONS ON THE PITCH OF AMPLITUDE-MODULATED NOISE [J].
BURNS, EM ;
VIEMEISTER, NF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1981, 70 (06) :1655-1660
[3]   NON-SPECTRAL PITCH [J].
BURNS, EM ;
VIEMEISTER, NF .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1976, 60 (04) :863-869
[4]   THE PERCEPTION OF TEMPORAL MODULATIONS BY COCHLEAR IMPLANT PATIENTS [J].
BUSBY, PA ;
TONG, YC ;
CLARK, GM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 94 (01) :124-131
[5]  
DEMANY L, 1993, MUSIC PERCEPT, V11, P1
[6]   Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs [J].
Dorman, MF ;
Loizou, PC ;
Rainey, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (04) :2403-2411
[7]   Effects of the salience of pitch and periodicity information on the intelligibility of four-channel vocoded speech: Implications for cochlear implants [J].
Faulkner, A ;
Rosen, S ;
Smith, C .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 2000, 108 (04) :1877-1887
[8]   A CROSS-LANGUAGE STUDY OF PROSODIC MODIFICATIONS IN MOTHERS AND FATHERS SPEECH TO PREVERBAL INFANTS [J].
FERNALD, A ;
TAESCHNER, T ;
DUNN, J ;
PAPOUSEK, M ;
DEBOYSSONBARDIES, B ;
FUKUI, I .
JOURNAL OF CHILD LANGUAGE, 1989, 16 (03) :477-501
[9]   Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor [J].
Fishman, KE ;
Shannon, RV ;
Slattery, WH .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1997, 40 (05) :1201-1215
[10]  
FOURCIN A, 1984, ARCH OTOLARYNGOL, V110, P145