Living in a partially structured environment: How to bypass the limitations of classical reinforcement techniques

被引：10

作者：

Gaussier, P

Revel, A

Joulain, C

Zrehen, S

机构：

[1] UNIV SO CALIF, CTR NEURAL ENGN, LOS ANGELES, CA 90089 USA

[2] UNIV SO CALIF, ROBOT LAB, LOS ANGELES, CA 90089 USA

来源：

ROBOTICS AND AUTONOMOUS SYSTEMS | 1997年 / 20卷 / 2-4期

关键词：

neural networks; unsupervised learning; topological maps; reinforcement; autonomous robot; all-or-none learning;

D O I：

10.1016/S0921-8890(97)80708-7

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In this paper, we propose an unsupervised neural network allowing a robot to learn sensory-motor associations with a delayed reward. The robot task is to learn the ''meaning'' of pictograms in order to ''survive'' in a maze. First, we introduce a new neural conditioning rule probabilistic conditioning rule (PCR) allowing us to test hypotheses (associations between visual categories and movements) during a given time span. Second, we describe a real maze experiment with our mobile robot. We propose a neural architecture overcoming the difficulty to build visual categories dynamically while associating them to movements. Third, we propose to use our algorithm on a simulation in order to test it exhaustively. We give the results for different kinds of mazes and we compare our system to an adapted version of the Q-learning algorithm. Finally, we conclude by showing the limitations of approaches that do not take into account the intrinsic complexity of a reasoning based on image recognition.

引用

页码：225 / 250

页数：26

共 39 条

[1] OUTLINE FOR A THEORY OF INTELLIGENCE [J].