Living in a partially structured environment: How to bypass the limitations of classical reinforcement techniques

被引:10
作者
Gaussier, P
Revel, A
Joulain, C
Zrehen, S
机构
[1] UNIV SO CALIF, CTR NEURAL ENGN, LOS ANGELES, CA 90089 USA
[2] UNIV SO CALIF, ROBOT LAB, LOS ANGELES, CA 90089 USA
关键词
neural networks; unsupervised learning; topological maps; reinforcement; autonomous robot; all-or-none learning;
D O I
10.1016/S0921-8890(97)80708-7
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
In this paper, we propose an unsupervised neural network allowing a robot to learn sensory-motor associations with a delayed reward. The robot task is to learn the ''meaning'' of pictograms in order to ''survive'' in a maze. First, we introduce a new neural conditioning rule probabilistic conditioning rule (PCR) allowing us to test hypotheses (associations between visual categories and movements) during a given time span. Second, we describe a real maze experiment with our mobile robot. We propose a neural architecture overcoming the difficulty to build visual categories dynamically while associating them to movements. Third, we propose to use our algorithm on a simulation in order to test it exhaustively. We give the results for different kinds of mazes and we compare our system to an adapted version of the Q-learning algorithm. Finally, we conclude by showing the limitations of approaches that do not take into account the intrinsic complexity of a reasoning based on image recognition.
引用
收藏
页码:225 / 250
页数:26
相关论文
共 39 条
[1]   OUTLINE FOR A THEORY OF INTELLIGENCE [J].
ALBUS, JS .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1991, 21 (03) :473-509
[2]  
[Anonymous], NEURAL DARWINISM
[3]   ASSOCIATIVE SEARCH NETWORK - A REINFORCEMENT LEARNING ASSOCIATIVE MEMORY [J].
BARTO, AG ;
SUTTON, RS ;
BROUWER, PS .
BIOLOGICAL CYBERNETICS, 1981, 40 (03) :201-211
[4]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[5]   LANDMARK LEARNING - AN ILLUSTRATION OF ASSOCIATIVE SEARCH [J].
BARTO, AG ;
SUTTON, RS .
BIOLOGICAL CYBERNETICS, 1981, 42 (01) :1-8
[6]   A ROBUST LAYERED CONTROL-SYSTEM FOR A MOBILE ROBOT [J].
BROOKS, RA .
IEEE JOURNAL OF ROBOTICS AND AUTOMATION, 1986, 2 (01) :14-23
[7]  
CARPENTER GA, 1987, P NEURAL NETWORK, V2, P737
[8]  
CHANGEUX JP, 1985, NEURONAL MAN BIOL MI
[9]   Deliberation and reactivity in autonomous mobile robots [J].
Chatila, R .
ROBOTICS AND AUTONOMOUS SYSTEMS, 1995, 16 (2-4) :197-211
[10]  
Gallistel C. R., 1993, ORG LEARNING