REINFORCEMENT STRUCTURE PARAMETER LEARNING FOR NEURAL-NETWORK-BASED FUZZY-LOGIC CONTROL-SYSTEMS

被引：192

作者：

LIN, CT

LEE, CSG

机构：

[1] NATL CHIAO TUNG UNIV,DEPT CONTROL ENGN,HSINCHU,TAIWAN

[2] PURDUE UNIV,SCH ELECT ENGN,W LAFAYETTE,IN 47907

来源：

IEEE TRANSACTIONS ON FUZZY SYSTEMS | 1994年 / 2卷 / 01期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/91.273126

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

This paper proposes a reinforcement neural-network-based fuzzy logic control system (RNN-FLCS) for solving various reinforcement learning problems. The proposed RNN-FLCS is constructed by integrating two neural-network-based fuzzy logic controllers (NN-FLC's), each of which is a connectionist model with a feedforward multilayered, network developed for the realization of a fuzzy logic controller. One NN-FLC performs as a fuzzy predictor, and the other as a fuzzy controller. Using the temporal difference prediction method, the fuzzy predictor can predict the external reinforcement signal and provide a more informative internal reinforcement signal to the fuzzy controller. The fuzzy controller performs a stochastic exploratory algorithm to adapt itself according to the internal reinforcement signal. During the learning process, both structure learning and parameter learning are performed simultaneously in the two NN-FLC's using the fuzzy similarity measure. The proposed RNN-FLCS can construct a fuzzy logic control and decision-making system automatically and dynamically through a reward/penalty signal (i.e., a ''good'' or ''bad'' signal) or through very simple fuzzy information feedback such as ''high,'' ''too high,'' ''low,'' and ''too low.'' The proposed RNN-FLCS is best applied to the learning environment, where obtaining exact training data is expensive. The proposed RNN-FLCS also preserves the advantages of the original NN-FLC, such as the ability to find proper network structure and parameters simultaneously and dynamically and to avoid the rule-matching time of the inference engine in the traditional fuzzy logic systems. Computer simulations were conducted to illustrate the performance and applicability of the proposed RNN-FLCS.

引用

页码：46 / 63

页数：18

共 33 条

[1] MECHANISMS OF PLANNING AND PROBLEM-SOLVING IN THE BRAIN [J].

ALBUS, JS .

MATHEMATICAL BIOSCIENCES, 1979, 45 (3-4) :247-293

[2]

ANDERSON CW, 1990, NEURAL NETWORKS CONT, pCHA

[3]

ANDERSON CW, 1987, 4TH P INT WORKSH MAC, P103

[4] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].

BARTO, AG ;

SUTTON, RS ;

ANDERSON, CW .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846

[5] LANDMARK LEARNING - AN ILLUSTRATION OF ASSOCIATIVE SEARCH [J].

BARTO, AG ;

SUTTON, RS .

BIOLOGICAL CYBERNETICS, 1981, 42 (01) :1-8

[6] PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].

BARTO, AG ;

ANANDAN, P .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375

[7]

BARTO AG, 1987 P INT JOINT C N, V2, P629

[8]

Berenji H. R., 1992, International Journal of Approximate Reasoning, V6, P267, DOI 10.1016/0888-613X(92)90020-Z

[9] LEARNING AND TUNING FUZZY-LOGIC CONTROLLERS THROUGH REINFORCEMENTS [J].

BERENJI, HR ;

KHEDKAR, P .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1992, 3 (05) :724-740

[10]

CANNON RH, 1967, DYNAMICS PHYSICAL SY

← 1 2 3 4 →