Rapid, safe, and incremental learning of navigation strategies

被引:52
作者
Millan, JD
机构
[1] Systems Engineering and Informatics, Joint Research Centre of the European Commission
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1996年 / 26卷 / 03期
关键词
D O I
10.1109/3477.499792
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper we propose a reinforcement connectionist learning architecture that allows an autonomous robot to acquire efficient navigation strategies in a few trials. Besides rapid learning, the architecture has three further appealing features. First, the robot improves its performance incrementally as it interacts with an initially unknown environment, and it ends up learning to avoid collisions even in those situations in which its sensors cannot detect the obstacles. This is a definite advantage over nonlearning reactive robots. Second, since it learns from basic reflexes, the robot is operational from the very beginning and the learning process is safe. Third, the robot exhibits high tolerance to noisy sensory data and good generalization abilities. All these features make this learning robot's architecture very well suited to real-world applications. We report experimental results obtained with a real mobile robot in an indoor environment that demonstrate the appropriateness of our approach to real autonomous robot control.
引用
收藏
页码:408 / 420
页数:13
相关论文
共 40 条
[1]  
ALBUS JS, 1981, BRAINS BEHAVIOR ROBO
[2]  
ALPAYDIN E, 1991, 91032 INT COMP SCI I
[3]  
[Anonymous], 1988, SELF ORG ASS MEMORY
[4]  
[Anonymous], 1989, THESIS CAMBRIDGE U E
[5]   Integrating behavioral, perceptual, and world knowledge in reactive navigation [J].
Arkin, Ronald C. .
Robotics and Autonomous Systems, 1990, 6 (1-2) :105-122
[6]   ROBOT MOTION PLANNING - A DISTRIBUTED REPRESENTATION APPROACH [J].
BARRAQUAND, J ;
LATOMBE, JC .
INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, 1991, 10 (06) :628-649
[7]   LEARNING TO ACT USING REAL-TIME DYNAMIC-PROGRAMMING [J].
BARTO, AG ;
BRADTKE, SJ ;
SINGH, SP .
ARTIFICIAL INTELLIGENCE, 1995, 72 (1-2) :81-138
[8]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[9]  
BARTO AG, 1989, 8995 COINS U MASS DE
[10]   REAL-TIME OBSTACLE AVOIDANCE FOR FAST MOBILE ROBOTS [J].
BORENSTEIN, J ;
KOREN, Y .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1989, 19 (05) :1179-1187