Speech perception as pattern recognition

被引:91
作者
Nearey, TM
机构
[1] Department of Linguistics, University of Alberta, Edmonton
关键词
D O I
10.1121/1.418290
中图分类号
O42 [声学];
学科分类号
070206 ; 082403 ;
摘要
This work provides theoretical and empirical arguments in favor of an approach to phonetics that is called double-weak. It is so called because it assumes relatively weak constraints both on the articulatory gestures and on the auditory patterns that map phonological elements. This approach views speech production and perception as distinct but cooperative systems. Like the motor theory of speech perception, double-weak theory accepts that phonological units are modified by context in ways that are important to perception. It further agrees that many aspects of such context dependency have their origin in natural articulatory processes. However, double-weak theory sides with proponents of auditory theories of phonetics by accepting that the real-time objects of perception are well-defined auditory patterns. Because speakers find ways to obey ''orderly output conditions'' (Sussman et al., 1995), listeners are able to successfully decode speech using relatively simple pattern-recognition mechanisms. It is suggested that this situation has arisen through a stylization of gestural patterns to accommodate real-time limits of the perceptual system. Results from a new perceptual experiment, involving a four-dimensional stimulus continuum and a 10-category/hVC/response set, are shown to be largely compatible with this framework. (C) 1997 Acoustical Society of America.
引用
收藏
页码:3241 / 3254
页数:14
相关论文
共 68 条
[1]   How Do Humans Process and Recognize Speech? [J].
Allen, Jont B. .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (04) :567-577
[2]  
BLUMSTEIN S, 1986, INVARIANCE VARIABILI, P465
[3]   PERCEPTUAL INVARIANCE AND ONSET SPECTRA FOR STOP CONSONANTS IN DIFFERENT VOWEL ENVIRONMENTS [J].
BLUMSTEIN, SE ;
STEVENS, KN .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1980, 67 (02) :648-662
[4]   MULTIPLICATIVE MODELS AND COHORT ANALYSIS [J].
BRESLOW, NE ;
LUBIN, JH ;
MAREK, P ;
LANGHOLZ, B .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1983, 78 (381) :1-12
[5]  
Chomsky Noam., 1968, The sound pattern of English
[6]   NATIVE LANGUAGE FACTORS AFFECTING USE OF VOCALIC CUES TO FINAL CONSONANT VOICING IN ENGLISH [J].
CROWTHER, CS ;
MANN, V .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1992, 92 (02) :711-722
[7]  
Diehl R.L., 1989, ECOL PSYCHOL, V1, P121, DOI DOI 10.1207/S15326969ECO0102_2
[8]  
Fant G., 1960, ACOUSTIC THEORY SPEE
[9]  
Finney D.J., 1977, PROBIT ANAL, VIII
[10]   SPECTRAL AND DURATION PROPERTIES OF FRONT VOWELS AS CUES TO FINAL STOP-CONSONANT VOICING [J].
FISCHER, RM ;
OHDE, RN .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1990, 88 (03) :1250-1259