Multimodal integration learning of robot behavior using deep neural networks

被引:128
作者
Noda, Kuniaki [1 ]
Arie, Hiroaki [1 ]
Suga, Yuki [1 ]
Ogata, Tetsuya [1 ]
机构
[1] Waseda Univ, Grad Sch Fundamental Sci & Engn, Dept Intermedia Art & Sci, Shinjuku Ku, Tokyo 1698555, Japan
关键词
Object manipulation; Multimodal integration; Cross-modal memory retrieval; Deep learning; MEMORY-IMPAIRED INDIVIDUALS;
D O I
10.1016/j.robot.2014.03.003
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
For humans to accurately understand the world around them, multimodal integration is essential because it enhances perceptual precision and reduces ambiguity. Computational models replicating such human ability may contribute to the practical use of robots in daily human living environments; however, primarily because of scalability problems that conventional machine learning algorithms suffer from, sensory-motor information processing in robotic applications has typically been achieved via modal-dependent processes. In this paper, we propose a novel computational framework enabling the integration of sensory-motor time-series data and the self-organization of multimodal fused representations based on a deep learning approach. To evaluate our proposed model, we conducted two behavior-learning experiments utilizing a humanoid robot; the experiments consisted of object manipulation and bell-ringing tasks. From our experimental results, we show that large amounts of sensory-motor information, including raw RGB images, sound spectrums, and joint angles, are directly fused to generate higher-level multimodal representations. Further, we demonstrated that our proposed framework realizes the following three functions: (1) cross-modal memory retrieval utilizing the information complementation capability of the deep autoencoder; (2) noise-robust behavior recognition utilizing the generalization capability of multimodal features; and (3) multimodal causality acquisition and sensory-motor prediction based on the acquired causality. (C) 2014 The Authors. Published by Elsevier B.V.
引用
收藏
页码:721 / 736
页数:16
相关论文
共 40 条
[1]  
[Anonymous], P IEEE INT C DEV LEA
[2]  
[Anonymous], 2010, P 10 INT C EP ROB LU
[3]  
[Anonymous], MEET GUY WHO HELPED
[4]  
[Anonymous], PERSONAL ROBOT 2 PR2
[5]  
[Anonymous], 2017, COMMUN ACM, DOI [DOI 10.1145/3065386, 10.1145/3065386]
[6]  
[Anonymous], 2011, ICML
[7]  
[Anonymous], NAO HUM
[8]  
[Anonymous], IMPROVING PHOTO SEAR
[9]  
[Anonymous], 2010, ICML
[10]  
[Anonymous], 2011, P 28 INT C MACH LEAR