Coadaptive Brain-Machine Interface via Reinforcement Learning

被引:103
作者
DiGiovanna, Jack [1 ]
Mahmoudi, Babak [1 ]
Fortes, Jose [2 ]
Principe, Jose C. [2 ]
Sanchez, Justin C. [3 ]
机构
[1] Univ Florida, Dept Biomed Engn, Gainesville, FL 32608 USA
[2] Univ Florida, Dept Elect & Comp Engn, Gainesville, FL 32608 USA
[3] Univ Florida, Dept Pediat, Div Neurol, JHMHC, Gainesville, FL 32610 USA
基金
美国国家科学基金会;
关键词
Brain-machine interface (BMI); coadaptation; neuroprosthetic; reinforcement learning (RL); MOTOR CORTEX; CORTICAL CONTROL; PREDICTION; ALGORITHMS; EXTRACTION; NEURONS; DEVICES; MODELS; RAT;
D O I
10.1109/TBME.2008.926699
中图分类号
R318 [生物医学工程];
学科分类号
0831 ;
摘要
This paper introduces and demonstrates a novel brain-machine interface (BMI) architecture based on the concepts of reinforcement learning (RL), coadaptation, and shaping. RL allows the BMI control algorithm to learn to complete tasks from interactions with the environment, rather than an explicit training signal. Coadaption enables continuous, synergistic adaptation between the BMI control algorithm and BMI user working in changing environments. Shaping is designed to reduce the learning curve for BMI users attempting to control a prosthetic; Here, we present the theory and in vivo experimental paradigm to illustrate how this BMI learns to complete a reaching task using a prosthetic arm in a 3-D workspace based on the user's neuronal activity. This semisupervised learning framework does not require user movements. We quantify BMI performance in closed-loop brain control over six to ten days for three rats as a function of increasing task difficulty. All three subjects coadapted with their BMI control algorithms to control the prosthetic significantly above chance at each level of difficulty.
引用
收藏
页码:54 / 64
页数:11
相关论文
共 44 条
[1]  
Bishop Christopher M, 1995, Neural networks for pattern recognition
[2]  
Bower G.H., 1981, Theories of Learning
[3]   Multiple neural spike train data analysis: state-of-the-art and future challenges [J].
Brown, EN ;
Kass, RE ;
Mitra, PP .
NATURE NEUROSCIENCE, 2004, 7 (05) :456-461
[4]  
Buzsaki G., 2006, Rhythms of the Brain
[5]  
Chapin J.K., 1990, The Cerebral Cortex of the Rat, P341
[6]  
Chapin J.K., 2001, METHODS NEW FRONTIER
[7]   Real-time control of a robot arm using simultaneously recorded neurons in the motor cortex [J].
Chapin, JK ;
Moxon, KA ;
Markowitz, RS ;
Nicolelis, MAL .
NATURE NEUROSCIENCE, 1999, 2 (07) :664-670
[8]  
DEISENROTH MP, 2008, AM CONTR C SEATTL WA
[9]  
DIGIOVANNA J, 2007, P IEEE EMBS C NEUR E, P530
[10]  
DIGIOVANNA J, 2007, INT C COMP SCI BEIJ