PERCEPTION OF SYNTHESIZED AUDIBLE AND VISIBLE SPEECH

被引:57
作者
MASSARO, DW
COHEN, MM
机构
关键词
D O I
10.1111/j.1467-9280.1990.tb00068.x
中图分类号
B84 [心理学];
学科分类号
04 ; 0402 ;
摘要
The research reported in this paper uses novel stimuli to study how speech perception is influenced by information presented to ear and eye. Auditory and visual sources of information (syllables) were synthesized and presented in isolation or in factorial combination. A five-step continuum between the syllables ibal and idal was synthesized along both auditory and visual dimensions, by varying properties of the syllable at its onset. The onsets of the second and third formants were manipulated in the audible speech. For the visible speech, the shape of the lips and the jaw position at the onset of the syllable were manipulated. Subjects' identification judgments of the test syllables presented on videotape were influenced by both auditory and visual information. The results were used to test between a fuzzy logical model of speech perception (FLMP) and a categorical model of perception (CMP). These tests indicate that evaluation and integration of the two sources of information makes available continuous as opposed to just categorical information. In addition, the integration of the two sources appears to be nonadditive in that the least ambiguous source has the largest impact on the judgment. The two sources of information appear to be evaluated, integrated, and identified as described by the FLMP–an optimal algorithm for combining information from multiple sources. The research provides a theoretical framework for understanding the improvement in speech perception by hearing-impaired listeners when auditory speech is supplemented with other sources of information. © 1990, Institution of Mechanical Engineers. All rights reserved.
引用
收藏
页码:55 / 63
页数:9
相关论文
共 26 条
[11]  
Massaro D. M., 1987, SPEECH PERCEPTION EA
[12]   HEARING LIPS AND SEEING VOICES [J].
MCGURK, H ;
MACDONALD, J .
NATURE, 1976, 264 (5588) :746-748
[13]   PHYSICAL CHARACTERISTICS OF THE LIPS UNDERLYING VOWEL LIPREADING PERFORMANCE [J].
MONTGOMERY, AA ;
JACKSON, PL .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1983, 73 (06) :2134-2144
[14]  
MONTGOMERY AA, 1980, J ACOUSTICAL SOC S1, V68, pS58
[15]  
*NAT RES COUNC, 1987, SPEECH UND AG REP
[16]  
PARKE FI, 1982, IEEE COMPUT GRAPH, V2, P61
[17]  
PARKE FI, 1974, UTECCSC75047 U UT TE
[18]  
PARKE FI, 1975, COMPUTERS GRAPHICS J, V1, P1
[19]  
PEARCE A, 1986, GRAPHICS INTERFACE 8, P136