Real-time decoding of question-and-answer speech dialogue using human cortical activity

被引:144
作者
Moses, David A. [1 ,2 ]
Leonard, Matthew K. [1 ,2 ]
Makin, Joseph G. [1 ,2 ]
Chang, Edward F. [1 ,2 ]
机构
[1] UC San Francisco, Dept Neurol Surg, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
[2] UC San Francisco, Ctr Integrat Neurosci, 675 Nelson Rising Lane, San Francisco, CA 94158 USA
关键词
HUMAN SENSORIMOTOR CORTEX; BRAIN-COMPUTER INTERFACE; ERROR;
D O I
10.1038/s41467-019-10994-4
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.
引用
收藏
页数:14
相关论文
共 58 条
  • [1] [Anonymous], ICML WORKSH STAT MAC
  • [2] Bergstra J, 2013, INT C MACHINE LEARNI, P115
  • [3] Bergstra J, 2011, ADV NEURAL INFORM PR, P2546, DOI 10.5555/2986459.2986743
  • [4] Human temporal lobe activation by speech and nonspeech sounds
    Binder, JR
    Frost, JA
    Hammeke, TA
    Bellgowan, PSF
    Springer, JA
    Kaufman, JN
    Possing, ET
    [J]. CEREBRAL CORTEX, 2000, 10 (05) : 512 - 528
  • [5] Neuroperceptual differences in consonant and vowel discrimination: As revealed by direct cortical electrical interference
    Boatman, D
    Hall, C
    Goldstein, MH
    Lesser, R
    Gordon, B
    [J]. CORTEX, 1997, 33 (01) : 83 - 98
  • [6] Boersma P., 2001, GLOT INT, V5, P341
  • [7] Functional organization of human sensorimotor cortex for speech articulation
    Bouchard, Kristofer E.
    Mesgarani, Nima
    Johnson, Keith
    Chang, Edward F.
    [J]. NATURE, 2013, 495 (7441) : 327 - 332
  • [8] A survey on self-assessed well-being in a cohort of chronic locked-in syndrome patients: happy majority, miserable minority
    Bruno, Marie-Aurelie
    Bernheim, Jan L.
    Ledoux, Didier
    Pellas, Frederic
    Demertzi, Athena
    Laureys, Steven
    [J]. BMJ OPEN, 2011, 1 (01):
  • [9] Spatiotemporal dynamics of word processing in the human brain
    Canolty, Ryan T.
    Soltani, Maryam
    Dalal, Sarang S.
    Edwards, Erik
    Dronkers, Nina F.
    Nagarajan, Srikantan S.
    Kirsch, Heidi E.
    Barbaro, Nicholas M.
    Knight, Robert T.
    [J]. FRONTIERS IN NEUROSCIENCE, 2007, 1 (01): : 185 - 196
  • [10] Functional and Quantitative MRI Mapping of Somatomotor Representations of Human Supralaryngeal Vocal Tract
    Carey, Daniel
    Krishnan, Saloni
    Callaghan, Martina F.
    Sereno, Martin I.
    Dick, Frederic
    [J]. CEREBRAL CORTEX, 2017, 27 (01) : 265 - 278