Reinforcement learning for an ART-based fuzzy adaptive learning control network

被引:60
作者
Lin, CJ
Lin, CT
机构
[1] Department of Control Engineering, National Chiao-Tung University, Hsinchu
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1996年 / 7卷 / 03期
关键词
D O I
10.1109/72.501728
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a reinforcement fuzzy adaptive learning control network (RFALCON) for solving various reinforcement learning problems, The proposed RFALCON is constructed by integrating two fuzzy adaptive learning control networks (FALCON's), each of which is a connectionist model with a feedforward multilayer network developed for the realization of a fuzzy controller, One FALCON performs as a critic network (fuzzy predictor), and the other as an action network (fuzzy controller), Using the temporal difference prediction method, the critic network can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the action network, The action network performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal, An ART-based reinforcement structure/parameter-learning algorithm is developed for constructing the RFALCON dynamically, During the learning process, both structure learning and parameter learning are performed simultaneously in the two FALCON's, The proposed RFALCON can construct a fuzzy control system dynamically and automatically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal), It is best applied to the learning environment, where obtaining exact training data is expensive, The proposed RFALCON has two important features, First, it reduces the combinatorial demands placed by the standard methods for adaptive Linearization of a system, Second, the RFALCON is a highly autonomous system, Initially, there are no hidden nodes (i.e., no membership functions or Fuzzy rules), They are created and begin to grow as learning proceeds, The RFALCON can also dynamically partition the input-output spaces, tune activation (membership) functions, and find proper network connection types (fuzzy rules), Computer simulations have been conducted to illustrate the performance and applicability of the proposed learning scheme.
引用
收藏
页码:709 / 731
页数:23
相关论文
共 39 条
[1]  
ANDERSON CW, 1987, 4TH P INT WORKSH MAC, P103
[2]  
[Anonymous], 1992, NEURAL NETWORKS FUZZ
[3]  
[Anonymous], P IEEE INT C FUZZ SY
[4]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[5]   LANDMARK LEARNING - AN ILLUSTRATION OF ASSOCIATIVE SEARCH [J].
BARTO, AG ;
SUTTON, RS .
BIOLOGICAL CYBERNETICS, 1981, 42 (01) :1-8
[6]   PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].
BARTO, AG ;
ANANDAN, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375
[7]  
BARTO AG, P INT JOINT C NEUR N, V2, P629
[8]  
BENBRAHIM H, 1992, P INT JOINT C NEUR N, V1, P98
[9]   LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].
BERENJI, HR ;
KHEDKAR, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740
[10]   FUZZY ART - FAST STABLE LEARNING AND CATEGORIZATION OF ANALOG PATTERNS BY AN ADAPTIVE RESONANCE SYSTEM [J].
CARPENTER, GA ;
GROSSBERG, S ;
ROSEN, DB .
NEURAL NETWORKS, 1991, 4 (06) :759-771