模仿学习在机器人仿生机制研究中的应用

被引:6
作者
于建均
门玉森
阮晓钢
徐骢驰
机构
[1] 北京工业大学电子信息与控制工程学院
基金
高等学校博士学科点专项科研基金;
关键词
机器人; 仿生学; 模仿学习; 行为表述; 强化学习;
D O I
暂无
中图分类号
TP242 [机器人];
学科分类号
1111 ;
摘要
较为系统地综述了机器人模仿学习的过程,并对该领域的相关关键问题进行了探讨.基于模仿的生物机制,构建了机器人模仿学习的一个工程应用框架,以该框架为指导,重点对模仿学习的行为表述问题及研究进展进行论述;对模仿学习和强化学习在机器人运动技能学习中的应用进行了对比分析;并对该领域的研究进行了展望,可见对机器人模仿学习的研究是机器人仿生机制研究的热点内容.
引用
收藏
页码:210 / 216
页数:7
相关论文
共 16 条
[1]   A syntactic approach to robot imitation learning using probabilistic activity grammars [J].
Lee, Kyuhwa ;
Su, Yanyu ;
Kim, Tae-Kyun ;
Demiris, Yiannis .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2013, 61 (12) :1323-1334
[2]   Interactive imitation learning of object movement skills [J].
Muehlig, Manuel ;
Gienger, Michael ;
Steil, Jochen J. .
AUTONOMOUS ROBOTS, 2012, 32 (02) :97-114
[3]   Imitation Learning of Positional and Force Skills Demonstrated via Kinesthetic Teaching and Haptic Input [J].
Kormushev, Petar ;
Calinon, Sylvain ;
Caldwell, Darwin G. .
ADVANCED ROBOTICS, 2011, 25 (05) :581-603
[4]   Single-Neuron Responses in Humans during Execution and Observation of Actions [J].
Mukamel, Roy ;
Ekstrom, Arne D. ;
Kaplan, Jonas ;
Iacoboni, Marco ;
Fried, Itzhak .
CURRENT BIOLOGY, 2010, 20 (08) :750-756
[5]   On-line learning and modulation of periodic movements with nonlinear dynamical systems [J].
Gams, Andrej ;
Ijspeert, Auke J. ;
Schaal, Stefan ;
Lenarcic, Jadran .
AUTONOMOUS ROBOTS, 2009, 27 (01) :3-23
[6]  
A survey of robot learning from demonstration[J] . Brenna D. Argall,Sonia Chernova,Manuela Veloso,Brett Browning.Robotics and Autonomous Systems . 2008 (5)
[7]   Reinforcement learning of motor skills with policy gradients [J].
Peters, Jan ;
Schaal, Stefan .
NEURAL NETWORKS, 2008, 21 (04) :682-697
[8]  
Discovering optimal imitation strategies[J] . Aude Billard,Yann Epars,Sylvain Calinon,Stefan Schaal,Gordon Cheng.Robotics and Autonomous Systems . 2004 (2)
[9]  
Learning from demonstration and adaptation of biped locomotion[J] . Jun Nakanishi,Jun Morimoto,Gen Endo,Gordon Cheng,Stefan Schaal,Mitsuo Kawato.Robotics and Autonomous Systems . 2004 (2)
[10]   A via-point time optimization algorithm for complex sequential trajectory formation [J].
Wada, Y ;
Kawato, M .
NEURAL NETWORKS, 2004, 17 (03) :353-364