ACOUSTIC INVARIANCE IN SPEECH PRODUCTION - EVIDENCE FROM MEASUREMENTS OF THE SPECTRAL CHARACTERISTICS OF STOP CONSONANTS

被引:277
作者
BLUMSTEIN, SE [1 ]
STEVENS, KN [1 ]
机构
[1] MIT,ELECTR RES LAB,CAMBRIDGE,MA 02139
关键词
D O I
10.1121/1.383319
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
On the basis of theoretical considerations and the results of experiments with synthetic consonant vowel syllables, it has been hypothesized that the short time spectrum sampled at the onset of a stop consonant should exhibit gross properties that uniquely specify the consonantal place of articulation independent of the following vowel. The aim of this paper is to test this hypothesis by measuring the spectrum sampled at the onsets and offsets of a large number of consonant vowel (CV) and vowel consonant (VC) syllables containing both voiced and voiceless stops produced by several speakers. Templates were devised in an attempt to capture three classes of spectral shapes: diffuse-rising, diffuse-falling, and compact, corresponding to alveolar, labial, and velar consonants, respectively. Spectra were derived from the utterances by sampling at the consonantal release of CV syllables and at the implosion and burst release of VC syllables, and these spectra (smoothed by a linear prediction algorithm) were matched against the templates. It was found that about 85% of the spectra at initial consonant release and at final burst release were correctly classified by the templates, although there was some variability across vowel contexts. The spectra sampled at the implosion were not consistently classified. A preliminary examination of spectra sampled at the release of nasal consonants in CV syllables showed a somewhat lower accuracy of classification by the same templates. Overall, the results support an hypothesis that, in natural speech, the acoustic characteristics of stop consonants, specified in terms of the gross spectral shape sampled at the discontinuity in the acoustic signal, show invariant properties independent of the adjacent vowel or of the voicing characteristics of the consonant. The implication is that the auditory system is endowed with detectors that are sensitive to these kinds of gross spectral shapes, and that the existence of these detectors helps the infant to organize the sounds of speech into their natural classes. © 1979, American Association of Physics Teachers. All rights reserved.
引用
收藏
页码:1001 / 1017
页数:17
相关论文
共 44 条
  • [1] Blumstein S.E., Stevens K.N., Perceptual invariance and onset spectra for stop consonants in different vowel environments, (1979)
  • [2] Carney A.E., Widin G.P., Viemeister N.F., Noncategorical perception of stop consonants differing in VOT, J. Acoust. Soc. Am. 62, pp. 961-970, (1977)
  • [3] Chomsky N., Halle M., The Sound Pattern of English, (1968)
  • [4] Cooper F.S., Delattre P.C., Liberman A.M., Borst J.M., Gerstman L.J., Some experiments on the perception of synthetic speech sounds, J. Acoust. Soc. Am. 24, pp. 597-606, (1952)
  • [5] Cutting J., Rosner B., Categories and boundaries in speech and music, Percept. Psychophys. 16, pp. 564-570, (1974)
  • [6] Delattre P.C., Liberman A.M., Cooper F.S., Acoustic loci and transitional cues for consonants, J. Acoust. Soc. Am. 27, pp. 769-773, (1955)
  • [7] Delgutte B., Codage des changements rapides d intensity dans nerf auditive: experiences avec des sons pures, (1978)
  • [8] Eimas P.D., Siqueland E.R., Speech perception in infants, Science 171, pp. 303-306, (1971)
  • [9] Eimas P.D., Auditory and linguistic processing of cues for place of articulation by infants, Percept. Psychophys. 16, pp. 513-521, (1974)
  • [10] Fant G., Acoustic Theory of Speech Production, (1960)