REINFORCEMENT STRUCTURE PARAMETER LEARNING FOR NEURAL-NETWORK-BASED FUZZY-LOGIC CONTROL-SYSTEMS

被引:192
作者
LIN, CT
LEE, CSG
机构
[1] NATL CHIAO TUNG UNIV,DEPT CONTROL ENGN,HSINCHU,TAIWAN
[2] PURDUE UNIV,SCH ELECT ENGN,W LAFAYETTE,IN 47907
基金
美国国家科学基金会;
关键词
D O I
10.1109/91.273126
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a reinforcement neural-network-based fuzzy logic control system (RNN-FLCS) for solving various reinforcement learning problems. The proposed RNN-FLCS is constructed by integrating two neural-network-based fuzzy logic controllers (NN-FLC's), each of which is a connectionist model with a feedforward multilayered, network developed for the realization of a fuzzy logic controller. One NN-FLC performs as a fuzzy predictor, and the other as a fuzzy controller. Using the temporal difference prediction method, the fuzzy predictor can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the fuzzy controller. The fuzzy controller performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal. During the learning process, both structure learning and parameter learning are performed simultaneously in the two NN-FLC's using the fuzzy similarity measure. The proposed RNN-FLCS can construct a fuzzy logic control and decision-making system automatically and dynamically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal) or through very simple fuzzy information feedback such as ''high,'' ''too high,'' ''low,'' and ''too low.'' The proposed RNN-FLCS is best applied to the learning environment, where obtaining exact training data is expensive. The proposed RNN-FLCS also preserves the advantages of the original NN-FLC, such as the ability to find proper network structure and parameters simultaneously and dynamically and to avoid the rule-matching time of the inference engine in the traditional fuzzy logic systems. Computer simulations were conducted to illustrate the performance and applicability of the proposed RNN-FLCS.
引用
收藏
页码:46 / 63
页数:18
相关论文
共 33 条
[1]   MECHANISMS OF PLANNING AND PROBLEM-SOLVING IN THE BRAIN [J].
ALBUS, JS .
MATHEMATICAL BIOSCIENCES, 1979, 45 (3-4) :247-293
[2]  
ANDERSON CW, 1990, NEURAL NETWORKS CONT, pCHA
[3]  
ANDERSON CW, 1987, 4TH P INT WORKSH MAC, P103
[4]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[5]   LANDMARK LEARNING - AN ILLUSTRATION OF ASSOCIATIVE SEARCH [J].
BARTO, AG ;
SUTTON, RS .
BIOLOGICAL CYBERNETICS, 1981, 42 (01) :1-8
[6]   PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].
BARTO, AG ;
ANANDAN, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375
[7]  
BARTO AG, 1987 P INT JOINT C N, V2, P629
[8]  
Berenji H. R., 1992, International Journal of Approximate Reasoning, V6, P267, DOI 10.1016/0888-613X(92)90020-Z
[9]   LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].
BERENJI, HR ;
KHEDKAR, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740
[10]  
CANNON RH, 1967, DYNAMICS PHYSICAL SY