Who Is Saying "What"? Brain-Based Decoding of Human Voice and Speech

被引:393
作者
Formisano, Elia [1 ]
De Martino, Federico [1 ]
Bonte, Milene [1 ]
Goebel, Rainer [1 ]
机构
[1] Univ Maastricht, Dept Cognit Neurosci, Fac Psychol & Neurosci, NL-6200 MD Maastricht, Netherlands
关键词
D O I
10.1126/science.1164318
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Can we decipher speech content ("what" is being said) and speaker identity ("who" is saying it) from observations of brain activity of a listener? Here, we combine functional magnetic resonance imaging with a data- mining algorithm and retrieve what and whom a person is listening to from the neural fingerprints that speech and voice signals elicit in the listener's auditory cortex. These cortical fingerprints are spatially distributed and insensitive to acoustic variations of the input so as to permit the brain- based recognition of learned speech from unknown speakers and of learned voices from previously unheard utterances. Our findings unravel the detailed cortical layout and computational properties of the neural populations at the basis of human speech recognition and speaker identification.
引用
收藏
页码:970 / 973
页数:4
相关论文
共 28 条
[1]   Adaptation to speaker's voice in right anterior temporal lobe [J].
Belin, P ;
Zatorre, RJ .
NEUROREPORT, 2003, 14 (16) :2105-2109
[2]   Thinking the voice:: neural correlates of voice perception [J].
Belin, P ;
Fecteau, S ;
Bédard, C .
TRENDS IN COGNITIVE SCIENCES, 2004, 8 (03) :129-135
[3]   Voice-selective areas in human auditory cortex [J].
Belin, P ;
Zatorre, RJ ;
Lafaille, P ;
Ahad, P ;
Pike, B .
NATURE, 2000, 403 (6767) :309-312
[4]   Human temporal lobe activation by speech and nonspeech sounds [J].
Binder, JR ;
Frost, JA ;
Hammeke, TA ;
Bellgowan, PSF ;
Springer, JA ;
Kaufman, JN ;
Possing, ET .
CEREBRAL CORTEX, 2000, 10 (05) :512-528
[5]  
Davis MH, 2003, J NEUROSCI, V23, P3423
[6]   Combining multivariate voxel selection and support vector machines for mapping and classification of fMRI spatial patterns [J].
De Martino, Federico ;
Valente, Giancarlo ;
Staeren, Noel ;
Ashburner, John ;
Goebel, Rainer ;
Formisano, Elia .
NEUROIMAGE, 2008, 43 (01) :44-58
[7]   Left posterior temporal regions are sensitive to auditory categorization [J].
Desai, Rutvik ;
Liebenthal, Einat ;
Waldron, Eric ;
Binder, Jeffrey R. .
JOURNAL OF COGNITIVE NEUROSCIENCE, 2008, 20 (07) :1174-1188
[8]  
EGINEER CT, 2008, NAT NEUROSCI, V11, P603
[9]   Multivariate analysis of fMRI time series: classification and regression of brain responses using machine learning [J].
Formisano, Elia ;
De Martino, Federico ;
Valente, Giancarlo .
MAGNETIC RESONANCE IMAGING, 2008, 26 (07) :921-934
[10]   Distributed and overlapping representations of faces and objects in ventral temporal cortex [J].
Haxby, JV ;
Gobbini, MI ;
Furey, ML ;
Ishai, A ;
Schouten, JL ;
Pietrini, P .
SCIENCE, 2001, 293 (5539) :2425-2430