Accurate consonant perception without mid-frequency speech energy

被引:78
作者
Lippmann, RP
机构
[1] Lincoln Laboratory Massachsetts, Institute of Technology, Lexington
来源
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING | 1996年 / 4卷 / 01期
关键词
D O I
10.1109/TSA.1996.481454
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
The intelligibility of consonants remains high (roughly 90% correct) for untrained human listeners when speech energy in the mid-frequencies (800 to 4 kHz) is filtered out of random CVC nonsense syllables using sharp highpass and lowpass filters. These results suggest that humans use a process for speech recognition that is fundamentally different from the types of template matching performed in modern hidden Markov model speech recognition systems. Such recognizers are extremely sensitive to channel variability, filtering, and noise and require careful preprocessing and microphone placement to provide acceptable performance. Humans are able to achieve extremely accurate consonant recognition accuracy with almost no training under this highly unnatural condition using high-frequency speech cues that are normally not provided at the input to speech recognizers.
引用
收藏
页码:66 / 69
页数:4
相关论文
共 12 条
[1]   How Do Humans Process and Recognize Speech? [J].
Allen, Jont B. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :567-577
[2]  
BERLIN C I, 1985, Seminars in Hearing, V6, P389, DOI 10.1055/s-0028-1092017
[3]  
BRAIDA L, 1979, ASHA MONOGRAPHS, V19
[4]   FACTORS GOVERNING THE INTELLIGIBILITY OF SPEECH SOUNDS [J].
FRENCH, NR ;
STEINBERG, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1947, 19 (01) :90-119
[5]   METHODS FOR CALCULATION AND USE OF ARTICULATION INDEX [J].
KRYTER, KD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (11) :1689-&
[6]   VALIDATION OF ARTICULATION INDEX [J].
KRYTER, KD .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1962, 34 (11) :1698-&
[7]   STUDY OF MULTICHANNEL AMPLITUDE COMPRESSION AND LINEAR AMPLIFICATION FOR PERSONS WITH SENSORINEURAL HEARING-LOSS [J].
LIPPMANN, RP ;
BRAIDA, LD ;
DURLACH, NI .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1981, 69 (02) :524-534
[8]   AN ANALYSIS OF PERCEPTUAL CONFUSIONS AMONG SOME ENGLISH CONSONANTS [J].
MILLER, GA ;
NICELY, PE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (02) :338-352
[9]  
MILNER P, 1984, ARTICULATION TESTING
[10]  
MORENO P, 1994, P IEEE ICASSP