Factors influencing audiovisual fission and fusion illusions

被引:159
作者
Andersen, TS [1 ]
Tiippana, K [1 ]
Sams, M [1 ]
机构
[1] Helsinki Univ Technol, Lab Comp Engn, Espoo 02015, Finland
来源
COGNITIVE BRAIN RESEARCH | 2004年 / 21卷 / 03期
基金
芬兰科学院;
关键词
multisensory illusions; discontinuity hypothesis; modality appropriateness; information reliability; directed attention; illusory flashes;
D O I
10.1016/j.cogbrainres.2004.06.004
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Information processing in auditory and visual modalities interacts in many circumstances. Spatially and temporally coincident acoustic and visual information are often bound together to form multisensory percepts [B.E. Stein, M.A. Meredith, The Merging of the Senses, A Bradford Book, Cambridge, MA, (1993), 211 pp.; Psychol. Bull. 88 (1980) 638]. Shams et al. recently reported a multisensory fission illusion where a single flash is perceived as two flashes when two rapid tone beeps are presented concurrently [Nature 408 (2000) 788; Cogn. Brain Res. 14 (2002) 147]. The absence of a fusion illusion, where two flashes would fuse to one when accompanied by one beep, indicated a perceptual rather than cognitive nature of the illusion. Here we report both fusion and fission illusions using stimuli very similar to those used by Shams et al. By instructing subjects to count beeps rather than flashes and decreasing the sound intensity to near threshold, we also created a corresponding visually induced auditory illusion. We discuss our results in light of four hypotheses of multisensory integration, each advocating a condition for modality dominance. According to the discontinuity hypothesis [Cogn. Brain Res. 14 (2002) 147], the modality in which stimulation is discontinuous dominates. The modality appropriateness hypothesis [Psychol. Bull. 88 (1980) 638] states that the modality more appropriate for the task at hand dominates. The information reliability hypothesis [J.-L. Schwartz, J. Robert-Ribes, P. Escudier, Ten years after Summerfield: a taxonomy of models for audio-visual fusion in speech perception. In: R. Campbell (Ed.), Hearing by Eye: The Psychology of Lipreading, Lawrence Earlbaum Associates, Hove, UK, (1998), pp. 3-51] claims that the modality providing more reliable information dominates. In strong forms, none of these three hypotheses applies to our data. We re-state the hypotheses in weak forms so that discontinuity, modality appropriateness and information reliability are factors which increase a modality's tendency to dominate. All these factors are important in explaining our data. Finally, we interpret the effect of instructions in light of the directed attention hypothesis which states that the attended modality is dominant [Psychol. Bull. 88 (1980) 638]. (C) 2004 Elsevier B.V. All rights reserved.
引用
收藏
页码:301 / 308
页数:8
相关论文
共 16 条
[1]  
Andersen T. S., 2001, P 4 INT ESCA ETRW C, P172
[2]  
[Anonymous], 2011, Categorical data analysis
[3]   EFFECTS OF PHONETIC CONTEXT ON AUDIOVISUAL INTELLIGIBILITY OF FRENCH [J].
BENOI, C ;
MOHAMADI, T ;
KANDEL, S .
JOURNAL OF SPEECH AND HEARING RESEARCH, 1994, 37 (05) :1195-1203
[4]   INTEGRATING SPEECH INFORMATION ACROSS TALKERS, GENDER, AND SENSORY MODALITY - FEMALE FACES AND MALE VOICES IN THE MCGURK EFFECT [J].
GREEN, KP ;
KUHL, PK ;
MELTZOFF, AN ;
STEVENS, EB .
PERCEPTION & PSYCHOPHYSICS, 1991, 50 (06) :524-536
[5]  
Lumley T., 2003, RMETA PACKAGE
[6]   VISUAL INFLUENCES ON SPEECH-PERCEPTION PROCESSES [J].
MACDONALD, J ;
MCGURK, H .
PERCEPTION & PSYCHOPHYSICS, 1978, 24 (03) :253-257
[7]  
MACLEOD A, 1987, British Journal of Audiology, V21, P131, DOI 10.3109/03005368709077786
[8]   HEARING LIPS AND SEEING VOICES [J].
MCGURK, H ;
MACDONALD, J .
NATURE, 1976, 264 (5588) :746-748
[9]   McGurk effect in Finnish syllables, isolated words, and words in sentences:: Effects of word meaning and sentence context [J].
Sams, M ;
Manninen, P ;
Surakka, V ;
Helin, P ;
Kättö, R .
SPEECH COMMUNICATION, 1998, 26 (1-2) :75-87
[10]  
Schwartz J.-L., 1998, HEARING EYE PSYCHOL, P3