Small-vocabulary speech recognition using surface electromyography

被引:27
作者
Betts, Bradley J.
Binsted, Kim
Jorgensen, Charles
机构
[1] NASA, UH, Astrobiol Inst, Dept Informat & Comp Sci, Honolulu, HI 96744 USA
[2] NASA, Ames Res Ctr, QSS Grp Inc, Moffett Field, CA 94035 USA
[3] NASA, Ames Res Ctr, Neuro Engn Lab, Moffett Field, CA 94035 USA
基金
美国国家航空航天局;
关键词
electromyography; EMG; bioelectric; EMG speech recognition; first responder; pattern recognition; SCBA;
D O I
10.1016/j.intcom.2006.08.012
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 ;
摘要
We present results of electromyographic (EMG) speech recognition on a small vocabulary of 15 English words. EMG speech recognition holds promise for mitigating the effects of high acoustic noise on speech intelligibility in communication systems, including those used by first responders (a focus of this work). We collected 150 examples per word of single-channel EMG data from a male subject, speaking normally while wearing a firefighter's self-contained breathing apparatus. The signal processing consisted of an activity detector, a feature extractor, and a neural network classifier. Testing produced an overall average correct classification rate on the 15 words of 74% with a 95% confidence interval of (71%, 77%). Once trained, the subject used a classifier as part of a real-time system to communicate to a cellular phone and to control a robotic device. These tasks were performed under an ambient noise level of approximately 95 decibels. We also describe ongoing work on phoneme-level EMG speech recognition. Crown Copyright (c) 2006 Published by Elsevier B.V. All rights reserved.
引用
收藏
页码:1242 / 1259
页数:18
相关论文
共 44 条
[21]   THE LOMBARD REFLEX AND ITS ROLE ON HUMAN LISTENERS AND AUTOMATIC SPEECH RECOGNIZERS [J].
JUNQUA, JC .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1993, 93 (01) :510-524
[22]   The Lombard effect: A reflex to better communicate with others in noise [J].
Junqua, JC ;
Fincke, S ;
Field, K .
ICASSP '99: 1999 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS VOLS I-VI, 1999, :2083-2086
[23]   A Robust Algorithm for Word Boundary Detection in the Presence of Noise [J].
Junqua, Jean-Claude ;
Mak, Brian ;
Reaves, Ben .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 1994, 2 (03) :406-412
[24]   Complex wavelets for shift invariant analysis and filtering of signals [J].
Kingsbury, N .
APPLIED AND COMPUTATIONAL HARMONIC ANALYSIS, 2001, 10 (03) :234-253
[25]  
Kumar S, 2004, Proceedings of the 2004 Intelligent Sensors, Sensor Networks & Information Processing Conference, P593
[26]   An improved voice activity detection using higher order statistics [J].
Li, K ;
Swamy, MNS ;
Ahmad, MO .
IEEE TRANSACTIONS ON SPEECH AND AUDIO PROCESSING, 2005, 13 (05) :965-974
[27]  
Manabe H, 2004, P ANN INT IEEE EMBS, V26, P4389
[28]  
Manabe H., 2003, CHI 03 EXTENDED ABST, P794, DOI DOI 10.1145/765891.765996
[29]   USE OF MYOELECTRIC SIGNALS TO RECOGNIZE SPEECH [J].
MORSE, MS ;
DAY, SH ;
TRULL, B ;
MORSE, H .
IMAGES OF THE TWENTY-FIRST CENTURY, PTS 1-6, 1989, 11 :1793-1794
[30]  
MORSE MS, 1991, PROCEEDINGS OF THE ANNUAL INTERNATIONAL CONFERENCE OF THE IEEE ENGINEERING IN MEDICINE AND BIOLOGY SOCIETY, VOL 13, PTS 1-5, P1877, DOI 10.1109/IEMBS.1991.684800