Teachable robots: Understanding human teaching behavior to build more effective robot learners

被引：213

作者：

Thomaz, Andrea L. ^{[1
]}

Breazeal, Cynthia ^{[1
]}

机构：

[1] Georgia Inst Technol, Atlanta, GA 30332 USA

来源：

ARTIFICIAL INTELLIGENCE | 2008年 / 172卷 / 6-7期

关键词：

human-robot interaction; reinforcement learning; user studies;

D O I：

10.1016/j.artint.2007.09.009

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

While Reinforcement Learning (RL) is not traditionally designed for interactive supervisory input from a human teacher, several works in both robot and software agents have adapted it for human input by letting a human trainer control the reward signal. In this work, we experimentally examine the assumption underlying these works, namely that the human-given reward is compatible with the traditional RL reward signal. We describe an experimental platform with a simulated RL robot and present an analysis of real-time human teaching behavior found in a study in which untrained subjects taught the robot to perform a new task. We report three main observations on how people administer feedback when teaching a Reinforcement Learning agent: (a) they use the reward channel not only for feedback, but also for future-directed guidance; (b) they have a positive bias to their feedback, possibly using the signal as a motivational channel; and (c) they change their behavior as they develop a mental model of the robotic learner. Given this, we made specific modifications to the simulated RL robot, and analyzed and evaluated its learning behavior in four follow-up experiments with human trainers. We report significant improvements on several learning measures. This work demonstrates the importance of understanding the human-teacher/robot-learner partnership in order to design algorithms that support how people want to teach and simultaneously improve the robot's learning behavior. (c) 2007 Elsevier B.V. All rights reserved.

引用

页码：716 / 737

页数：22

共 38 条

[1]

ARGYLE M, 1973, SEMIOTICA, P19

[2]

ARKIN R, 2003, P C ROB AUT SYST

[3] THE ROLE OF EMOTION IN BELIEVABLE AGENTS [J].

BATES, J .

COMMUNICATIONS OF THE ACM, 1994, 37 (07) :122-125

[4]

BLUMBERG B, 2002, P ACM SIGGRAPH

[5]

Blumberg B.M., 1997, THESIS MIT

[6]

BREAZEAL C, 2004, INT J HUMANOID ROBIT, V1

[7]

Breazeal Cynthia, 2002, DESIGNING SOCIABLE R

[8]

CLOUSE J, 1992, P 9 INT C MACH LEARN, P92

[9]

COHN D, 1995, ADV NEURAL INFORM PR, V7

[10]

Evans R., 2002, AI GAME PROGRAMMING, P567

← 1 2 3 4 →