On the number of channels needed to understand speech

被引:157
作者
Loizou, PC [1 ]
Dorman, M
Tu, ZM
机构
[1] Univ Texas Dallas, Dept Elect Engn, Richardson, TX 75083 USA
[2] Arizona State Univ, Dept Speech & Hearing Sci, Tempe, AZ 85287 USA
[3] Univ Arkansas, Dept Appl Sci, Little Rock, AR 72204 USA
关键词
D O I
10.1121/1.427954
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Recent studies have shown that high levels of speech understanding could be achieved when the speech spectrum was divided into four channels and then reconstructed as a sum of four noise bands or sine waves with frequencies equal to the center frequencies of the channels. In these studies speech understanding was assessed using sentences produced by a single male talker. The aim of experiment 1 was to assess the number of channels necessary for a high level of speech understanding when sentences were produced by multiple talkers. In experiment 1, sentences produced by 135 different talkers were processed through n (2 less than or equal to n less than or equal to 16) number of channels, synthesized as a sum of n sine waves with frequencies equal to the center frequencies of the filters, and presented to normal-hearing listeners for identification. A minimum of live channels was needed to achieve a high level (90%) of speech understanding. Asymptotic performance was achieved with eight channels, at least for the speech material used in this study. The outcome of experiment 1 demonstrated that the number of channels needed to reach asymptotic performance varies as a function of the recognition task and/or need for listeners to attend to fine phonetic detail. In experiment 2, sentences were processed through 6 and 16 channels and quantized into a small number of steps. The purpose of this experiment was to investigate whether listeners use across-channel differences in amplitude to code frequency information, particularly when speech is processed through a small number of channels. For sentences processed through six channels there was a significant reduction in speech understanding when the spectral amplitudes were quantized into a small number (<8) of steps. High levels (92%) of speech understanding were maintained for sentences processed through 16 channels and quantized into only 2 steps. The findings of experiment 2 suggest an inverse relationship between the importance of spectral amplitude resolution (number of steps) and spectral resolution (number of channels). (C) 1999 Acoustical Society of America. [S0001-4966(99)01810-X].
引用
收藏
页码:2097 / 2103
页数:7
相关论文
共 28 条
[1]  
[Anonymous], 1977, HASKINS LAB STATUS R
[2]   2 LEFT-HEMISPHERE MECHANISMS IN SPEECH-PERCEPTION [J].
CUTTING, JE .
PERCEPTION & PSYCHOPHYSICS, 1974, 16 (03) :601-612
[3]   NATURALNESS AND DISTORTION IN SPEECH-PROCESSING DEVICES [J].
DAVID, EE .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1956, 28 (04) :586-589
[4]   AN EXPERIMENTAL STUDY OF THE ACOUSTIC DETERMINANTS OF VOWEL COLOR; OBSERVATIONS ON ONE-AND TWO-FORMANT VOWELS SYNTHESIZED FROM SPECTROGRAPHIC PATTERNS [J].
Delattre, Pierre ;
Liberman, Alvin M. ;
Cooper, Franklin S. ;
Gerstman, Louis J. .
WORD-JOURNAL OF THE INTERNATIONAL LINGUISTIC ASSOCIATION, 1952, 8 (03) :195-210
[5]   Speech intelligibility as a function of the number of channels of stimulation for signal processors using sine-wave and noise-band outputs [J].
Dorman, MF ;
Loizou, PC ;
Rainey, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1997, 102 (04) :2403-2411
[6]   TEMPORAL ENVELOPE AND FINE-STRUCTURE CUES FOR SPEECH-INTELLIGIBILITY [J].
DRULLMAN, R .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1995, 97 (01) :585-592
[7]   Remaking speech [J].
Dudley, H .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1939, 11 (02) :169-177
[8]  
Fant G., 1973, Speech sounds and features
[9]  
Flanagan J. L., 1972, Speech Analysis Synthesis and Perception
[10]   Effects of noise and spectral resolution on vowel and consonant recognition: Acoustic and electric hearing [J].
Fu, QJ ;
Shannon, RV ;
Wang, XS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1998, 104 (06) :3586-3596