SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES

被引:2232
作者
SHANNON, RV [1 ]
ZENG, FG [1 ]
KAMATH, V [1 ]
WYGONSKI, J [1 ]
EKELID, M [1 ]
机构
[1] HOUSE EAR RES INST, 2100 W 3RD ST, LOS ANGELES, CA 90057 USA
关键词
D O I
10.1126/science.270.5234.303
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information. Temporal envelopes of speech were extracted from broad frequency bands and were used to modulate noises of the same bandwidths. This manipulation preserved temporal envelope cues in each band but restricted the listener to severely degraded information on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance was obtained with only three bands of modulated noise. Thus, the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.
引用
收藏
页码:303 / 304
页数:2
相关论文
共 26 条
[1]   How Do Humans Process and Recognize Speech? [J].
Allen, Jont B. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :567-577
[2]   CONSONANT RECOGNITION AS A FUNCTION OF THE NUMBER OF CHANNELS OF STIMULATION BY PATIENTS WHO USE THE SYMBION COCHLEAR IMPLANT [J].
DORMAN, M ;
DANKOWSKI, K ;
MCCANDLESS, G ;
SMITH, L .
EAR AND HEARING, 1989, 10 (05) :288-291
[3]   ACOUSTIC CUES FOR CONSONANT IDENTIFICATION BY PATIENTS WHO USE THE INERAID COCHLEAR IMPLANT [J].
DORMAN, MF ;
SOLI, S ;
DANKOWSKI, K ;
SMITH, LM ;
PARKIN, J ;
MCCANDLESS, G .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (05) :2074-2079
[4]   AUDITORY PHONETIC CATEGORIZATION WITH THE SYMBION MULTICHANNEL COCHLEAR IMPLANT [J].
DORMAN, MF ;
HANNLEY, MT ;
MCCANDLESS, GA ;
SMITH, LM .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 84 (02) :501-510
[5]   THE CODING OF VOWEL IDENTITY BY PATIENTS WHO USE THE INERAID COCHLEAR IMPLANT [J].
DORMAN, MF ;
SMITH, L ;
SMITH, M ;
PARKIN, J .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (06) :3428-3431
[6]   SPEECH-DISCRIMINATION IN DEAF SUBJECTS WITH COCHLEAR IMPLANTS [J].
EDDINGTON, DK .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 68 (03) :885-891
[7]   EFFECT OF CONSONANT-VOWEL RATIO MODIFICATION ON AMPLITUDE ENVELOPE CUES FOR CONSONANT RECOGNITION [J].
FREYMAN, RL ;
NERBONNE, GP ;
COTE, HA .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1991, 34 (02) :415-426
[8]   SPEECH RECOGNITION AS A FUNCTION OF CHANNEL CAPACITY IN A DISCRETE SET OF CHANNELS [J].
HILL, FJ ;
MCRAE, LP ;
MCCLELLAN, RP .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1968, 44 (01) :13-+
[9]  
Licklider J.C.R., 1951, HDB EXPT PSYCHOL, P1040
[10]   EFFECTS OF DIFFERENTIATION, INTEGRATION, AND INFINITE PEAK CLIPPING UPON THE INTELLIGIBILITY OF SPEECH [J].
LICKLIDER, JCR ;
POLLACK, I .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1948, 20 (01) :42-51