Direct classification of all American English phonemes using signals from functional speech motor cortex

被引:136
作者
Mugler, Emily M. [1 ]
Patton, James L. [1 ]
Flint, Robert D. [2 ]
Wright, Zachary A. [2 ]
Schuele, Stephan U. [2 ]
Rosenow, Joshua [2 ]
Shih, Jerry J. [3 ]
Krusienski, Dean J. [4 ]
Slutzky, Marc W. [2 ]
机构
[1] Univ Illinois, Chicago, IL 60607 USA
[2] Northwestern Univ, Chicago, IL 60611 USA
[3] Mayo Clin, Jacksonville, FL 32224 USA
[4] Old Dominion Univ, Norfolk, VA 23529 USA
基金
美国国家科学基金会; 美国国家卫生研究院;
关键词
electrocorticography; speech production; phonemes; linear discriminant analysis; brain-computer interface; BRAIN-COMPUTER INTERFACES; MOVEMENT; COMMUNICATION; RESTORATION;
D O I
10.1088/1741-2560/11/3/035015
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
Objective. Although brain-computer interfaces (BCIs) can be used in several different ways to restore communication, communicative BCI has not approached the rate or efficiency of natural human speech. Electrocorticography (ECoG) has precise spatiotemporal resolution that enables recording of brain activity distributed over a wide area of cortex, such as during speech production. In this study, we sought to decode elements of speech production using ECoG. Approach. We investigated words that contain the entire set of phonemes in the general American accent using ECoG with four subjects. Using a linear classifier, we evaluated the degree to which individual phonemes within each word could be correctly identified from cortical signal. Main results. We classified phonemes with up to 36% accuracy when classifying all phonemes and up to 63% accuracy for a single phoneme. Further, misclassified phonemes follow articulation organization described in phonology literature, aiding classification of whole words. Precise temporal alignment to phoneme onset was crucial for classification success. Significance. We identified specific spatiotemporal features that aid classification, which could guide future applications. Word identification was equivalent to information transfer rates as high as 3.0 bits s(-1) (33.6 words min(-1)), supporting pursuit of speech articulation for BCI control.
引用
收藏
页数:8
相关论文
共 32 条
[1]   Movement related activity in the high gamma range of the human EEG [J].
Ball, Tonio ;
Demandt, Evariste ;
Mutschler, Isabella ;
Neitzel, Eva ;
Mehring, Carsten ;
Vogt, Klaus ;
Aertsen, Ad ;
Schulze-Bonhage, Andreas .
NEUROIMAGE, 2008, 41 (02) :302-310
[2]   A high-speed BCI based on code modulation VEP [J].
Bin, Guangyu ;
Gao, Xiaorong ;
Wang, Yijun ;
Li, Yun ;
Hong, Bo ;
Gao, Shangkai .
JOURNAL OF NEURAL ENGINEERING, 2011, 8 (02)
[3]   Brain-computer interfaces: communication and restoration of movement in paralysis [J].
Birbaumer, Niels ;
Cohen, Leonardo G. .
JOURNAL OF PHYSIOLOGY-LONDON, 2007, 579 (03) :621-636
[4]   Localization and classification of phonemes using high spatial resolution electrocorticography (ECoG) grids [J].
Blakely, Timothy ;
Miller, Kai J. ;
Rao, Rajesh P. N. ;
Holmes, Mark D. ;
Ojemann, Jeffrey G. .
2008 30TH ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOLS 1-8, 2008, :4964-+
[5]   Functional organization of human sensorimotor cortex for speech articulation [J].
Bouchard, Kristofer E. ;
Mesgarani, Nima ;
Johnson, Keith ;
Chang, Edward F. .
NATURE, 2013, 495 (7441) :327-332
[6]  
Brown A., 2013, The Encyclopedia of Applied Linguistics
[7]   Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex [J].
Brumberg, Jonathan S. ;
Wright, E. Joe ;
Andreasen, Dinal S. ;
Guenther, Frank H. ;
Kennedy, Philip R. .
FRONTIERS IN NEUROSCIENCE, 2011, 5 :1-12
[8]   Accurate decoding of reaching movements from field potentials in the absence of spikes [J].
Flint, Robert D. ;
Lindberg, Eric W. ;
Jordan, Luke R. ;
Miller, Lee E. ;
Slutzky, Marc W. .
JOURNAL OF NEURAL ENGINEERING, 2012, 9 (04)
[9]   Local field potentials allow accurate decoding of muscle activity [J].
Flint, Robert D. ;
Ethier, Christian ;
Oby, Emily R. ;
Miller, Lee E. ;
Slutzky, Marc W. .
JOURNAL OF NEUROPHYSIOLOGY, 2012, 108 (01) :18-24
[10]   A Wireless Brain-Machine Interface for Real-Time Speech Synthesis [J].
Guenther, Frank H. ;
Brumberg, Jonathan S. ;
Wright, E. Joseph ;
Nieto-Castanon, Alfonso ;
Tourville, Jason A. ;
Panko, Mikhail ;
Law, Robert ;
Siebert, Steven A. ;
Bartels, Jess L. ;
Andreasen, Dinal S. ;
Ehirim, Princewill ;
Mao, Hui ;
Kennedy, Philip R. .
PLOS ONE, 2009, 4 (12)