Codevelopmental learning between human and humanoid robot using a dynamic neural-network model

被引:32
作者
Tani, Jun [1 ]
Nishimoto, Ryu [1 ]
Namikawa, Jun [1 ]
Ito, Masato [2 ]
机构
[1] Brain Sci Inst, RIKEN, Wako, Saitama 3510198, Japan
[2] Sony Corp, Tokyo 1698050, Japan
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 2008年 / 38卷 / 01期
关键词
compositionality; continuous-time recurrent neural network (CTRNN); development learning; humanoid robot;
D O I
10.1109/TSMCB.2007.907738
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper examines characteristics of interactive learning between human tutors and a robot having a dynamic neural-network model, which is inspired by human parietal cortex functions. A humanoid robot, with a recurrent neural network that has a hierarchical structure, learns to manipulate objects. Robots learn tasks in repeated self-trials with the assistance of human interaction, which provides physical guidance until the tasks are mastered and learning is consolidated within the neural networks. Experimental results and the analyses showed the following: 1) codevelopmental shaping of task behaviors stems from interactions between the robot and a tutor; 2) dynamic structures for articulating and sequencing of behavior primitives are self-organized in the hierarchically organized network; and 3) such structures can afford both generalization and context dependency in generating skilled behaviors.
引用
收藏
页码:43 / 59
页数:17
相关论文
共 51 条
[1]  
Arbib M.A., 2011, HDB PHYSL NERVOUS SY, P1449, DOI 10.1002/cphy.cp010233
[2]  
Atkeson C. G., 1997, P 14 INT C MACH LEAR, P12
[3]  
BERTA A, 1987, Bulletin of the Florida State Museum Biological Sciences, V31, P1
[4]   Evolving the neural controller for a robotic arm able to grasp objects on the basis of tactile sensors [J].
Bianco, R ;
Nolfi, S .
ADAPTIVE BEHAVIOR, 2004, 12 (01) :37-45
[5]   On learning, representing, and generalizing a task in a humanoid robot [J].
Calinon, Sylvain ;
Guenter, Florent ;
Billard, Aude .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2007, 37 (02) :286-298
[6]  
Demiris J, 2002, FROM ANIM ANIMAT, P327
[7]  
DEMIRIS Y, 2002, IMITATION ANIMALS AR, P1448
[8]   Multiple model-based reinforcement learning [J].
Doya, K ;
Samejima, K ;
Katagiri, K ;
Kawato, M .
NEURAL COMPUTATION, 2002, 14 (06) :1347-1369
[9]  
DOYA K, 1989, P 1989 INT JOINT C N, V1, P27
[10]   Ventral intraparietal area of the macaque: Congruent visual and somatic response properties [J].
Duhamel, JR ;
Colby, CL ;
Goldberg, ME .
JOURNAL OF NEUROPHYSIOLOGY, 1998, 79 (01) :126-136