A syntactic approach to robot imitation learning using probabilistic activity grammars

被引:51
作者
Lee, Kyuhwa [1 ]
Su, Yanyu [1 ]
Kim, Tae-Kyun [1 ]
Demiris, Yiannis [1 ]
机构
[1] Univ London Imperial Coll Sci Technol & Med, Dept Elect & Elect Engn, Personal Robot Lab, London SW7 2BT, England
关键词
Robot imitation learning; Probabilistic grammars; Activity representation; RECOGNITION; ROUTE; TASKS;
D O I
10.1016/j.robot.2013.08.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper describes a syntactic approach to imitation learning that captures important task structures in the form of probabilistic activity grammars from a reasonably small number of samples under noisy conditions. We show that these learned grammars can be recursively applied to help recognize unforeseen, more complicated tasks that share underlying structures. The grammars enforce an observation to be consistent with the previously observed behaviors which can correct unexpected, out-of-context actions due to errors of the observer and/or demonstrator. To achieve this goal, our method (1) actively searches for frequently occurring action symbols that are subsets of input samples to uncover the hierarchical structure of the demonstration, and (2) considers the uncertainties of input symbols due to imperfect low-level detectors. We evaluate the proposed method using both synthetic data and two sets of real-world humanoid robot experiments. In our Towers of Hanoi experiment, the robot learns the important constraints of the puzzle after observing demonstrators solving it. In our Dance Imitation experiment, the robot learns 3 types of dances from human demonstrations. The results suggest that under reasonable amount of noise, our method is capable of capturing the reusable task structures and generalizing them to cope with recursions. (C) 2013 Elsevier B.V. All rights reserved.
引用
收藏
页码:1323 / 1334
页数:12
相关论文
共 47 条
[1]  
Aloimonos Y, 2010, HUMAN-CENTRIC INTERFACES FOR AMBIENT INTELLIGENCE, P95, DOI 10.1016/B978-0-12-374708-2.00005-X
[2]  
[Anonymous], P IEEE INT C DEV LEA
[3]  
[Anonymous], 2003, Proceedings of the 2rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), DOI DOI 10.1145/860575.860614
[4]  
[Anonymous], 2008, SPRINGER HDB ROBOTIC
[5]  
Asada M, 2006, SPRINGER TRAC ADV RO, V21, P269
[6]   IMITATION LEARNING OF DUAL-ARM MANIPULATION TASKS IN HUMANOID ROBOTS [J].
Asfour, T. ;
Azad, P. ;
Gyarfas, F. ;
Dillmann, R. .
INTERNATIONAL JOURNAL OF HUMANOID ROBOTICS, 2008, 5 (02) :183-202
[7]   Discerning intentions in dynamic human action [J].
Baldwin, DA ;
Baird, JA .
TRENDS IN COGNITIVE SCIENCES, 2001, 5 (04) :171-178
[8]   Learning similar tasks from observation and practice [J].
Bentivegna, Darrin C. ;
Atkeson, Christopher G. ;
Cheng, Gordon .
2006 IEEE/RSJ INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS, VOLS 1-12, 2006, :2677-2683
[9]   Discriminative and adaptive imitation in uni-manual and bi-manual tasks [J].
Billard, Aude G. ;
Calinon, Sylvain ;
Guenter, Florent .
ROBOTICS AND AUTONOMOUS SYSTEMS, 2006, 54 (05) :370-384
[10]  
Bradski G, 2000, DR DOBBS J, V25, P120