SPEECH CODING IN THE AUDITORY-NERVE .2. PROCESSING SCHEMES FOR VOWEL-LIKE SOUNDS

被引:67
作者
DELGUTTE, B
机构
[1] MASSACHUSETTS EYE & EAR INFIRM, EATON PEABODY LAB AUDITORY PHYSIOL, BOSTON, MA 02114 USA
[2] MIT, DEPT ELECT ENGN & COMP SCI, CAMBRIDGE, MA 02139 USA
[3] MIT, ELECTR RES LAB, CAMBRIDGE, MA 02139 USA
关键词
D O I
10.1121/1.390597
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
Several processing schemes by which phonetically important information for vowels can be extracted from responses of [cat] auditory-nerve fibers are analyzed. The schemes are based on power spectra of period histograms obtained in response to a set of nine 2-formant, steady-state, vowel-like stimuli presented at 60 and 75 dB SPL [sound pressure level]. One class of local filtering schemes, originally proposed by Young and Sachs, consists of analyzing response patterns by filters centered at the characteristic frequencies (CF) of the fibers, so that a tonotopically arranged measure of synchronized response can be obtained. Various schemes in this class differ in the characteristics of the filter. For a wide range of filter bandwidths, formant frequencies correspond approximately to the CF for which the response measure is maximal. If in addition, the bandwidths of the analyzing filters are made compatible with psychophysical measures of frequency selectivity, low-frequency harmonics of the stimulus fundamental are resolved in the output profile, so that fundamental frequency can also be estimated. In a 2nd class of processing schemes, a dominant response component is defined for each fiber from a 1/6 octave spectral representation of the response pattern, and the formant frequencies are estimated from the most frequent values of the dominant component in the ensemble of auditory-nerve fibers. The local filtering schemes and the dominant component schemes can be related to place and periodicity models of [human] auditory processing, respectively.
引用
收藏
页码:879 / 886
页数:8
相关论文
共 43 条
  • [1] Carlson R, 1982, REPRESENTATION SPEEC, P109
  • [2] CARLSON R, 1980, QPSR34 ROYAL I TECHN, P84
  • [3] Carlson R., 1975, Auditory analysis and perception of speech, P55, DOI DOI 10.1016/B978-0-12-248550-3.50008-8
  • [4] CHISTOVICH L. A., 1957, BIOFIZIKA [TRANSL], V2, P714
  • [5] COOPER FS, 1952, J ACOUST SOC AM, V37, P318
  • [6] SPEECH CODING IN THE AUDITORY-NERVE .5. VOWELS IN BACKGROUND-NOISE
    DELGUTTE, B
    KIANG, NYS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (03) : 908 - 918
  • [7] SPEECH CODING IN THE AUDITORY-NERVE .1. VOWEL-LIKE SOUNDS
    DELGUTTE, B
    KIANG, NYS
    [J]. JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1984, 75 (03) : 866 - 878
  • [8] Delgutte B., 1982, REPRESENTATION SPEEC, P131
  • [9] Fant C.G., 1973, Speech Sounds and Features
  • [10] Flanagan J.L., 1972, SPEECH ANAL SYNTHESI