Linear correlates in the speech signal: The orderly output constraint

被引:83
作者
Sussman, HM [1 ]
Fruchter, D
Hilbert, J
Sirosh, J
机构
[1] Univ Texas, Dept Linguist & Commun Sci & Disorders, Austin, TX 78712 USA
[2] Univ Texas, Dept Linguist, Austin, TX 78712 USA
[3] Univ Texas, Dept Comp Sci, Austin, TX 78712 USA
[4] HNC Software Inc, San Diego, CA 92121 USA
关键词
acoustic; linearity; locus equations; neuroethology; noninvariance; perception; phoneme; place of articulation; sound categories; speech signal;
D O I
10.1017/S0140525X98001174
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
Neuroethological investigations of mammalian and avian auditory systems have documented species-specific specializations for processing complex acoustic signals that could, viewed in abstract terms, have an intriguing and striking relevance for human speech sound categorization and representation. Each species forms biologically relevant categories based on combinatorial analysis of information-bearing parameters within the complex input signal. This target article uses known neural models from the mustached bat and barn owl to develop, by analogy, a conceptualization of human processing of consonant plus vowel sequences that offers a partial solution to the noninvariance dilemma - the nontransparent relationship between the acoustic waveform and the phonetic segment. Critical input sound parameters used to establish species-specific categories in the mustached bat and barn owl exhibit high correlation and linearity due to physical laws. A cue long known to be relevant to the perception of stop place of articulation is the second formant (F2) transition. This article describes an empirical phenomenon - the locus equations - that describes the relationship between the F2 of a vowel and the F2 measured at the onset of a consonant-vowel (CV) transition. These variables, F2 onset and F2 vowel within a given place category, are consistently and robustly linearly correlated across diverse speakers and languages, and even under perturbation conditions as imposed by bite blocks. A functional role for this category-level extreme correlation and linearity (the "orderly output constraint") is hypothesized based on the notion of an evolutionarily conserved auditory-processing strategy. High correlation and linearity between critical parameters in the speech signal that help to cue place of articulation categories might have evolved to satisfy a preadaptation by mammalian auditory systems for representing tightly correlated, linearly related components of acoustic signals.
引用
收藏
页码:241 / +
页数:24
相关论文
共 74 条
[1]  
AMERMAN JD, 1970, THESIS U ILLINOIS
[2]   ACOUSTIC INVARIANCE IN SPEECH PRODUCTION - EVIDENCE FROM MEASUREMENTS OF THE SPECTRAL CHARACTERISTICS OF STOP CONSONANTS [J].
BLUMSTEIN, SE ;
STEVENS, KN .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1979, 66 (04) :1001-1017
[3]  
Carre R., 1992, Journal d'Acoustique, V5, P141
[4]  
CELDRAN EM, 1995, P 13 INT C PHON SCI, V1, P30
[5]  
CHURCHLAND P S, 1989, P15
[6]   ACOUSTIC LOCI AND TRANSITIONAL CUES FOR CONSONANTS [J].
DELATTRE, PC ;
LIBERMAN, AM ;
COOPER, FS .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1955, 27 (04) :769-773
[7]  
FITZPATRICK DC, 1993, J NEUROSCI, V13, P931
[8]   INVARIANTS, SPECIFIERS, CUES - AN INVESTIGATION OF LOCUS EQUATIONS AS INFORMATION FOR PLACE OF ARTICULATION [J].
FOWLER, CA .
PERCEPTION & PSYCHOPHYSICS, 1994, 55 (06) :597-610
[9]  
FRUCHTER D, 1994, J ACOUST SOC AM, V95, P2977
[10]   MATING CALL SELECTIVITY IN THE THALAMUS AND MIDBRAIN OF THE LEOPARD FROG (RANA-P-PIPIENS) - SINGLE AND MULTIUNIT ANALYSES [J].
FUZESSERY, ZM ;
FENG, AS .
JOURNAL OF COMPARATIVE PHYSIOLOGY, 1983, 150 (03) :333-344