Multisensory integration sites identified by perception of spatial wavelet filtered visual speech gesture information

被引:81
作者
Callan, DE
Jones, JA
Munhall, K
Kroos, C
Callan, AM
Vatikiotis-Bateson, E
机构
[1] ATR Int, Kyoto, Japan
[2] Wilfrid Laurier Univ, Waterloo, ON N2L 3C5, Canada
[3] Queens Univ, Kingston, ON K7L 3N6, Canada
[4] Univ Munich, D-80539 Munich, Germany
[5] Univ British Columbia, Vancouver, BC V5Z 1M9, Canada
关键词
D O I
10.1162/089892904970771
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Perception of speech is improved when presentation of the audio signal is accompanied by concordant visual speech gesture information. This enhancement is most prevalent when the audio signal is degraded. One potential means by which the brain affords perceptual enhancement is thought to be through the integration of concordant information from multiple sensory channels in a common site of convergence, multisensory integration (MSI) sites. Some studies have identified potential sites in the superior temporal gyrus/sulcus (STG/S) that are responsive to multisensory information from the auditory speech signal and visual speech movement. One limitation of these studies is that they do not control for activity resulting from attentional modulation cued by such things as visual information signaling the onsets and offsets of the acoustic speech signal, as well as activity resulting from MSI of properties of the auditory speech signal with aspects of gross visual motion that are not specific to place of articulation information. This fMRI experiment uses spatial wavelet band-pass filtered Japanese sentences presented with background multispeaker audio noise to discern brain activity reflecting MSI induced by auditory and visual correspondence of place of articulation information that controls for activity resulting from the above-mentioned factors. The experiment consists of a low-frequency (LF) filtered condition containing gross visual motion of the lips, jaw, and head without specific place of articulation information, a midfrequency (MF) filtered condition containing place of articulation information, and an unfiltered (UF) condition. Sites of MSI selectively induced by auditory and visual correspondence of place of articulation information were determined by the presence of activity for both the MF and UF conditions relative to the LF condition. Based on these criteria, sites of MSI were found predominantly in the left middle temporal gyrus (MTG), and the left STG/S ( including the auditory cortex). By controlling for additional factors that could also induce greater activity resulting from visual motion information, this study identifies potential MSI sites that we believe are involved with improved speech perception intelligibility.
引用
收藏
页码:805 / 816
页数:12
相关论文
共 59 条
[1]   Social perception from visual cues: role of the STS region [J].
Allison, T ;
Puce, A ;
McCarthy, G .
TRENDS IN COGNITIVE SCIENCES, 2000, 4 (07) :267-278
[2]  
[Anonymous], [No title captured]
[3]  
[Anonymous], 2000, Brain mapping: The systems, DOI 10.1016/b978-012692545-6/50014-3
[4]   The functional anatomy of visual-tactile integration in man: a study using positron emission tomography [J].
Banati, RB ;
Goerres, GW ;
Tjoa, C ;
Aggleton, JP ;
Grasby, P .
NEUROPSYCHOLOGIA, 2000, 38 (02) :115-124
[5]   Visual speech perception without primary auditory cortex activation [J].
Bernstein, LE ;
Auer, ET ;
Moore, JK ;
Ponton, CW ;
Don, M ;
Singh, M .
NEUROREPORT, 2002, 13 (03) :311-315
[6]   Human temporal lobe activation by speech and nonspeech sounds [J].
Binder, JR ;
Frost, JA ;
Hammeke, TA ;
Bellgowan, PSF ;
Springer, JA ;
Kaufman, JN ;
Possing, ET .
CEREBRAL CORTEX, 2000, 10 (05) :512-528
[7]  
BUSHARA K, 2001, J NEUROSCI, V21, P200
[8]   Single-sweep EEG analysis of neural processes underlying perception and production of vowels [J].
Callan, DE ;
Callan, AM ;
Honda, K ;
Masaki, S .
COGNITIVE BRAIN RESEARCH, 2000, 10 (1-2) :173-176
[9]   Neural processes underlying perceptual enhancement by visual speech gestures [J].
Callan, DE ;
Jones, JA ;
Munhall, K ;
Callan, AM ;
Kroos, C ;
Vatikiotis-Bateson, E .
NEUROREPORT, 2003, 14 (17) :2213-2218
[10]   Multimodal contribution to speech perception revealed by independent component analysis: a single-sweep EEG case study [J].
Callan, DE ;
Callan, AM ;
Kroos, C ;
Vatikiotis-Bateson, E .
COGNITIVE BRAIN RESEARCH, 2001, 10 (03) :349-353