Speech recognition with altered spectral distribution of envelope cues

被引:162
作者
Shannon, RV [1 ]
Zeng, FG [1 ]
Wygonski, J [1 ]
机构
[1] House Ear Inst, Los Angeles, CA 90057 USA
关键词
D O I
10.1121/1.423774
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recognition of consonants, vowels, and sentences was measured in conditions of reduced spectral resolution and distorted spectral distribution of temporal envelope cues. Speech materials were processed through four bandpass filters (analysis bands), half-wave rectified, and low-pass filtered to extract the temporal envelope from each band. The envelope from each speech band modulated a band-limited noise (carrier bands). Analysis and carrier bands were manipulated independently to alter the spectral distribution of envelope cues. Experiment I demonstrated that the location df the cutoff frequencies defining the bands was not a critical parameter for speech recognition, as long as the analysis and carrier bands were matched in frequency extent. Experiment II demonstrated a dramatic decrease in performance when the analysis and carrier bands did not match in frequency extent, which resulted in a warping of the spectral distribution of envelope cues. Experiment III demonstrated a large decrease in performance when the carrier bands were shifted in frequency, mimicking the basal position of electrodes in a cochlear implant. And experiment IV showed a relatively minor effect of the overlap in the noise carrier bands, simulating the overlap in neural populations responding to adjacent electrodes in a cochlear implant. Overall, these results show that, for four bands, the frequency alignment of the analysis bands and carrier bands is critical for good performance, while the exact frequency divisions and overlap in carrier bands are not as critical. (C) 1998 Acoustical Society of America. [S0001-4966(98)02210-3].
引用
收藏
页码:2467 / 2476
页数:10
相关论文
共 47 条
[11]   EFFECT OF TEMPORAL ENVELOPE SMEARING ON SPEECH RECEPTION [J].
DRULLMAN, R ;
FESTEN, JM ;
PLOMP, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1994, 95 (02) :1053-1064
[13]   Speech recognition as a function of the number of electrodes used in the SPEAK cochlear implant speech processor [J].
Fishman, KE ;
Shannon, RV ;
Slattery, WH .
JOURNAL OF SPEECH LANGUAGE AND HEARING RESEARCH, 1997, 40 (05) :1201-1215
[14]   Articulation testing methods [J].
Fletcher, H ;
Steinberg, JC .
BELL SYSTEM TECHNICAL JOURNAL, 1929, 8 :806-854
[15]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[16]   THE CONTRIBUTION OF FUNDAMENTAL-FREQUENCY, AMPLITUDE ENVELOPE, AND VOICING DURATION CUES TO SPEECHREADING IN NORMAL-HEARING SUBJECTS [J].
GRANT, KW ;
ARDELL, LH ;
KUHL, PK ;
SPARKS, DW .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1985, 77 (02) :671-677
[17]   SINGLE BAND AMPLITUDE ENVELOPE CUES AS AN AID TO SPEECHREADING [J].
GRANT, KW ;
BRAIDA, LD ;
RENN, RJ .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION A-HUMAN EXPERIMENTAL PSYCHOLOGY, 1991, 43 (03) :621-645
[18]   A COCHLEAR FREQUENCY-POSITION FUNCTION FOR SEVERAL SPECIES - 29 YEARS LATER [J].
GREENWOOD, DD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 87 (06) :2592-2605
[19]   DISCHARGE PATTERNS OF CAT PRIMARY AUDITORY FIBERS WITH ELECTRICAL-STIMULATION OF THE COCHLEA [J].
HARTMANN, R ;
TOPP, G ;
KLINKE, R .
HEARING RESEARCH, 1984, 13 (01) :47-62
[20]  
Hartrampf R., 1995, Annals of Otology Rhinology and Laryngology, V104, P277