Active learning: Learning a motor skill without a coach

被引：32

作者：

Huang, Vincent S. ^{[1
]}

Shadmehr, Reza ^{[1
]}

Diedrichsen, Joern ^{[2
]}

机构：

[1] Johns Hopkins Sch Med, Dept Biomed Engn, Lab Computat Motor Control, Baltimore, MD USA

[2] Bangor Univ, Sch Psychol, Bangor, Gwynedd, Wales

来源：

JOURNAL OF NEUROPHYSIOLOGY | 2008年 / 100卷 / 02期

关键词：

D O I：

10.1152/jn.01095.2007

中图分类号：

Q189 [神经科学];

学科分类号：

071006 [神经生物学];

摘要：

When we learn a new skill (e. g., golf) without a coach, we are "active learners": we have to choose the specific components of the task on which to train (e. g., iron, driver, putter, etc.). What guides our selection of the training sequence? How do choices that people make compare with choices made by machine learning algorithms that attempt to optimize performance? We asked subjects to learn the novel dynamics of a robotic tool while moving it in four directions. They were instructed to choose their practice directions to maximize their performance in subsequent tests. We found that their choices were strongly influenced by motor errors: subjects tended to immediately repeat an action if that action had produced a large error. This strategy was correlated with better performance on test trials. However, even when participants performed perfectly on a movement, they did not avoid repeating that movement. The probability of repeating an action did not drop below chance even when no errors were observed. This behavior led to suboptimal performance. It also violated a strong prediction of current machine learning algorithms, which solve the active learning problem by choosing a training sequence that will maximally reduce the learner's uncertainty about the task. While we show that these algorithms do not provide an adequate description of human behavior, our results suggest ways to improve human motor learning by helping people choose an optimal training sequence.

引用

页码：879 / 887

页数：9

共 19 条

[1]

Application of motor learning principles to complex surgical tasks: Searching for the optimal practice schedule [J].

Brydges, Ryan ;

Carnahan, Heather ;

Backstein, David ;

Dubrowski, Adam .

JOURNAL OF MOTOR BEHAVIOR, 2007, 39 (01) :40-48

[2]

The central nervous system stabilizes unstable dynamics by learning optimal impedance [J].

Burdet, E ;

Osu, R ;

Franklin, DW ;

Milner, TE ;

Kawato, M .

NATURE, 2001, 414 (6862) :446-449

[3]

Feedback after good trials enhances learning [J].

Chiviacowsky, Suzete ;

Wulf, Gabriele .

RESEARCH QUARTERLY FOR EXERCISE AND SPORT, 2007, 78 (02) :40-47

[4]

Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration [J].

Cohen, Jonathan D. ;

McClure, Samuel M. ;

Yu, Angela J. .

PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY B-BIOLOGICAL SCIENCES, 2007, 362 (1481) :933-942

[5]

Active learning with statistical models [J].

Cohn, DA ;

Ghahramani, Z ;

Jordan, MI .

JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145

[6]

Cortical substrates for exploratory decisions in humans [J].

Daw, Nathaniel D. ;

O'Doherty, John P. ;

Dayan, Peter ;

Seymour, Ben ;

Dolan, Raymond J. .

NATURE, 2006, 441 (7095) :876-879

[7]

Donchin O, 2003, J NEUROSCI, V23, P9032

[8]

A bottom-up model of spatial attention predicts human error patterns in rapid scene recognition [J].

Einhaeuser, Wolfgang ;

Mundhenk, T. Nathan ;

Baldi, Pierre ;

Koch, Christof ;

Itti, Laurent .

JOURNAL OF VISION, 2007, 7 (10)

[9]

Evolution of motor memory during the seconds after observation of motor error [J].

Huang, Vincent S. ;

Shadmehr, Reza .

JOURNAL OF NEUROPHYSIOLOGY, 2007, 97 (06) :3976-3985

[10]

A gain-field encoding of limb position and velocity in the internal model of arm dynamics [J].

Hwang, EJ ;

Donchin, O ;

Smith, MA ;

Shadmehr, R .

PLOS BIOLOGY, 2003, 1 (02) :209-220

← 1 2 →