Rapid, safe, and incremental learning of navigation strategies

被引：52

作者：

Millan, JD

机构：

[1] Systems Engineering and Informatics, Joint Research Centre of the European Commission

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1996年 / 26卷 / 03期

关键词：

D O I：

10.1109/3477.499792

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper we propose a reinforcement connectionist learning architecture that allows an autonomous robot to acquire efficient navigation strategies in a few trials. Besides rapid learning, the architecture has three further appealing features. First, the robot improves its performance incrementally as it interacts with an initially unknown environment, and it ends up learning to avoid collisions even in those situations in which its sensors cannot detect the obstacles. This is a definite advantage over nonlearning reactive robots. Second, since it learns from basic reflexes, the robot is operational from the very beginning and the learning process is safe. Third, the robot exhibits high tolerance to noisy sensory data and good generalization abilities. All these features make this learning robot's architecture very well suited to real-world applications. We report experimental results obtained with a real mobile robot in an indoor environment that demonstrate the appropriateness of our approach to real autonomous robot control.

引用

页码：408 / 420

页数：13

共 40 条

[1]

ALBUS JS, 1981, BRAINS BEHAVIOR ROBO

[2]

ALPAYDIN E, 1991, 91032 INT COMP SCI I

[3]

[Anonymous], 1988, SELF ORG ASS MEMORY

[4]

[Anonymous], 1989, THESIS CAMBRIDGE U E