A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

被引：13

作者：

Kuo, YH ^{[1
]}

Hsu, JP ^{[1
]}

Wang, CW ^{[1
]}

机构：

[1] Natl Cheng Kung Univ, Inst Informat Engn, Tainan 70101, Taiwan

来源：

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1998年 / 28卷 / 02期

关键词：

D O I：

10.1109/3477.662757

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

This paper proposes a three-layered parallel fuzzy inference model called reinforcement fuzzy neural network with distributed prediction scheme (RFNN-DPS), which performs reinforcement learning with a novel distributed prediction scheme, In RFNN-DPS, an additional predictor for predicting the external reinforcement signal is mot necessary, and the internal reinforcement information is distributed into fuzzy rules (rule nodes), Therefore, using RFNN-DPS, only one network is needed to construct a fuzzy logic system with the abilities of parallel inference and reinforcement learning, Basically, the information for prediction in RFNN-DPS is composed of credit values stored in fuzzy rule nodes, where each node holds a credit vector to represent the reliability of the corresponding fuzzy rule, The credit values are not only accessed for predicting external reinforcement signals, but also provide a more profitable internal reinforcement signal to each fuzzy rule itself, RFNN-DPS performs a credit-based exploratory algorithm to adjust its internal status according to the internal reinforcement signal, During learning, the RFNN-DPS network is constructed by a single-step or multistep reinforcement learning algorithm based on the ART concept, According to our experimental results, RFNN-DPS shows the advantages of simple network structure,fast learning speed, and explicit representation of rule reliability.

引用

页码：160 / 172

页数：13

共 18 条

[1]

Anderson C. W., 1987, Proceedings of the Fourth International Workshop on Machine Learning, P103

[2]

ANDERSON CW, 1990, NEURAL NETWORKS CONT

[3]

[Anonymous], 1992, NEURAL NETWORKS FUZZ

[4]

[Anonymous], 1991, FUZZY SET THEORY ITS

[5] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].

BARTO, AG ;

SUTTON, RS ;

ANDERSON, CW .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846

[6] PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].

BARTO, AG ;

ANANDAN, P .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375

[7]

Franklin J. A., 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989 (Cat. No.89TH0282-4), P115, DOI 10.1109/ISIC.1989.238708

[8]

Freeman J., 1992, NEURAL NETWORKS ALGO

[9] A STOCHASTIC REINFORCEMENT LEARNING ALGORITHM FOR LEARNING REAL-VALUED FUNCTIONS [J].

GULLAPALLI, V .

NEURAL NETWORKS, 1990, 3 (06) :671-692

[10]

LIN CJ, 1995, P 4 IEEE C FUZZ SYST, V3, P1299

← 1 2 →