SPEECH RECOGNITION WITH PRIMARILY TEMPORAL CUES

被引:2232
作者
SHANNON, RV [1 ]
ZENG, FG [1 ]
KAMATH, V [1 ]
WYGONSKI, J [1 ]
EKELID, M [1 ]
机构
[1] HOUSE EAR RES INST, 2100 W 3RD ST, LOS ANGELES, CA 90057 USA
关键词
D O I
10.1126/science.270.5234.303
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Nearly perfect speech recognition was observed under conditions of greatly reduced spectral information. Temporal envelopes of speech were extracted from broad frequency bands and were used to modulate noises of the same bandwidths. This manipulation preserved temporal envelope cues in each band but restricted the listener to severely degraded information on the distribution of spectral energy. The identification of consonants, vowels, and words in simple sentences improved markedly as the number of bands increased; high speech recognition performance was obtained with only three bands of modulated noise. Thus, the presentation of a dynamic temporal pattern in only a few broad spectral regions is sufficient for the recognition of speech.
引用
收藏
页码:303 / 304
页数:2
相关论文
共 26 条
[11]   AN ANALYSIS OF PERCEPTUAL CONFUSIONS AMONG SOME ENGLISH CONSONANTS [J].
MILLER, GA ;
NICELY, PE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (02) :338-352
[12]   SPEECH-PERCEPTION WITHOUT TRADITIONAL SPEECH CUES [J].
REMEZ, RE ;
RUBIN, PE ;
PISONI, DB ;
CARRELL, TD .
SCIENCE, 1981, 212 (4497) :947-950
[13]   PROSODIC AND SEGMENTAL ASPECTS OF SPEECH-PERCEPTION WITH THE HOUSE 3M SINGLE-CHANNEL IMPLANT [J].
ROSEN, S ;
WALLIKER, J ;
BRIMACOMBE, JA ;
EDGERTON, BJ .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1989, 32 (01) :93-111
[14]   TEMPORAL INFORMATION IN SPEECH - ACOUSTIC, AUDITORY AND LINGUISTIC ASPECTS [J].
ROSEN, S .
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY OF LONDON SERIES B-BIOLOGICAL SCIENCES, 1992, 336 (1278) :367-373
[15]   VOICE PITCH AS AN AID TO LIPREADING [J].
ROSEN, SM ;
FOURCIN, AJ ;
MOORE, BCJ .
NATURE, 1981, 291 (5811) :150-152
[16]   VOCODERS - ANALYSIS AND SYNTHESIS OF SPEECH [J].
SCHROEDE.MR .
PROCEEDINGS OF THE INSTITUTE OF ELECTRICAL AND ELECTRONICS ENGINEERS, 1966, 54 (05) :720-&
[17]   TEMPORAL-MODULATION TRANSFER-FUNCTIONS IN PATIENTS WITH COCHLEAR IMPLANTS [J].
SHANNON, RV .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 91 (04) :2156-2164
[19]   MULTICHANNEL ELECTRICAL-STIMULATION OF THE AUDITORY-NERVE IN MAN .1. BASIC PSYCHOPHYSICS [J].
SHANNON, RV .
HEARING RESEARCH, 1983, 11 (02) :157-189
[20]  
SHANNON RV, 1992, AUDITORY PROCESSING, P263