Incremental Local Online Gaussian Mixture Regression for Imitation Learning of Multiple Tasks

被引：43

作者：

Cederborg, Thomas

Li, Ming

Baranes, Adrien

Oudeyer, Pierre-Yves

机构：

来源：

IEEE/RSJ 2010 INTERNATIONAL CONFERENCE ON INTELLIGENT ROBOTS AND SYSTEMS (IROS 2010) | 2010年

关键词：

D O I：

10.1109/IROS.2010.5652040

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Gaussian Mixture Regression has been shown to be a powerful and easy-to-tune regression technique for imitation learning of constrained motor tasks in robots. Yet, current formulations are not suited when one wants a robot to learn incrementally and online a variety of new context-dependant tasks whose number and complexity is not known at programming time, and when the demonstrator is not allowed to tell the system when he introduces a new task (but rather the system should infer this from the continuous sensorimotor context). In this paper, we show that this limitation can be addressed by introducing an Incremental, Local and Online variation of Gaussian Mixture Regression (ILO-GMR) which successfully allows a simulated robot to learn incrementally and online new motor tasks through modelling them locally as dynamical systems, and able to use the sensorimotor context to cope with the absence of categorical information both during demonstrations and when a reproduction is asked to the system. Moreover, we integrate a complementary statistical technique which allows the system to incrementally learn various tasks which can be intrinsically defined in different frames of reference, which we call framings, without the need to tell the system which particular framing should be used for each task: this is inferred automatically by the system.

引用

页码：267 / 274

页数：8

共 22 条

[1] Correspondence mapping induced state and action metrics for robotic imitation [J].

Alissandrakis, Aris ;

Nehaniv, Chrystopher L. ;

Dautenhahn, Kerstin .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :299-307

[2]

[Anonymous], 2000, Proceedings of the international conference on machine learning (ICML)

[3]

[Anonymous], 2003, Proceedings of the 2rd International Joint Conference on Autonomous Agents and Multiagent Systems (AAMAS), DOI DOI 10.1145/860575.860614

[4]

[Anonymous], P INT C ROB AUT ICRA

[5]

Billard A., 2008, Springer Handbook of robotics

[6]

Calinon S., 2009, P IEEE RAS INT C HUM

[7]

Calinon S., 2009, P INT C ADV ROB ICAR

[8]

Calinon S, 2007, IMITATION AND SOCIAL LEARNING IN ROBOTS, HUMANS AND ANIMALS: BEHAVIOURAL, SOCIAL AND COMMUNICATIVE DIMENSIONS, P153, DOI 10.1017/CBO9780511489808.012

[9] On learning, representing, and generalizing a task in a humanoid robot [J].

Calinon, Sylvain ;

Guenter, Florent ;

Billard, Aude .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :286-298

[10]

Calinon Sylvain, 2009, Robot programming by demonstration - a probabilistic approach, robot programming by demonstration - a probabilistic approach

← 1 2 3 →