Teachable robots: Understanding human teaching behavior to build more effective robot learners

被引:213
作者
Thomaz, Andrea L. [1 ]
Breazeal, Cynthia [1 ]
机构
[1] Georgia Inst Technol, Atlanta, GA 30332 USA
关键词
human-robot interaction; reinforcement learning; user studies;
D O I
10.1016/j.artint.2007.09.009
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
While Reinforcement Learning (RL) is not traditionally designed for interactive supervisory input from a human teacher, several works in both robot and software agents have adapted it for human input by letting a human trainer control the reward signal. In this work, we experimentally examine the assumption underlying these works, namely that the human-given reward is compatible with the traditional RL reward signal. We describe an experimental platform with a simulated RL robot and present an analysis of real-time human teaching behavior found in a study in which untrained subjects taught the robot to perform a new task. We report three main observations on how people administer feedback when teaching a Reinforcement Learning agent: (a) they use the reward channel not only for feedback, but also for future-directed guidance; (b) they have a positive bias to their feedback, possibly using the signal as a motivational channel; and (c) they change their behavior as they develop a mental model of the robotic learner. Given this, we made specific modifications to the simulated RL robot, and analyzed and evaluated its learning behavior in four follow-up experiments with human trainers. We report significant improvements on several learning measures. This work demonstrates the importance of understanding the human-teacher/robot-learner partnership in order to design algorithms that support how people want to teach and simultaneously improve the robot's learning behavior. (c) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:716 / 737
页数:22
相关论文
共 38 条
[1]  
ARGYLE M, 1973, SEMIOTICA, P19
[2]  
ARKIN R, 2003, P C ROB AUT SYST
[3]   THE ROLE OF EMOTION IN BELIEVABLE AGENTS [J].
BATES, J .
COMMUNICATIONS OF THE ACM, 1994, 37 (07) :122-125
[4]  
BLUMBERG B, 2002, P ACM SIGGRAPH
[5]  
Blumberg B.M., 1997, THESIS MIT
[6]  
BREAZEAL C, 2004, INT J HUMANOID ROBIT, V1
[7]  
Breazeal Cynthia, 2002, DESIGNING SOCIABLE R
[8]  
CLOUSE J, 1992, P 9 INT C MACH LEARN, P92
[9]  
COHN D, 1995, ADV NEURAL INFORM PR, V7
[10]  
Evans R., 2002, AI GAME PROGRAMMING, P567