A parallel fuzzy inference model with distributed prediction scheme for reinforcement learning

被引:13
作者
Kuo, YH [1 ]
Hsu, JP [1 ]
Wang, CW [1 ]
机构
[1] Natl Cheng Kung Univ, Inst Informat Engn, Tainan 70101, Taiwan
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1998年 / 28卷 / 02期
关键词
D O I
10.1109/3477.662757
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper proposes a three-layered parallel fuzzy inference model called reinforcement fuzzy neural network with distributed prediction scheme (RFNN-DPS), which performs reinforcement learning with a novel distributed prediction scheme, In RFNN-DPS, an additional predictor for predicting the external reinforcement signal is mot necessary, and the internal reinforcement information is distributed into fuzzy rules (rule nodes), Therefore, using RFNN-DPS, only one network is needed to construct a fuzzy logic system with the abilities of parallel inference and reinforcement learning, Basically, the information for prediction in RFNN-DPS is composed of credit values stored in fuzzy rule nodes, where each node holds a credit vector to represent the reliability of the corresponding fuzzy rule, The credit values are not only accessed for predicting external reinforcement signals, but also provide a more profitable internal reinforcement signal to each fuzzy rule itself, RFNN-DPS performs a credit-based exploratory algorithm to adjust its internal status according to the internal reinforcement signal, During learning, the RFNN-DPS network is constructed by a single-step or multistep reinforcement learning algorithm based on the ART concept, According to our experimental results, RFNN-DPS shows the advantages of simple network structure,fast learning speed, and explicit representation of rule reliability.
引用
收藏
页码:160 / 172
页数:13
相关论文
共 18 条
[1]  
Anderson C. W., 1987, Proceedings of the Fourth International Workshop on Machine Learning, P103
[2]  
ANDERSON CW, 1990, NEURAL NETWORKS CONT
[3]  
[Anonymous], 1992, NEURAL NETWORKS FUZZ
[4]  
[Anonymous], 1991, FUZZY SET THEORY ITS
[5]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[6]   PATTERN-RECOGNIZING STOCHASTIC LEARNING AUTOMATA [J].
BARTO, AG ;
ANANDAN, P .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1985, 15 (03) :360-375
[7]  
Franklin J. A., 1989, Proceedings. IEEE International Symposium on Intelligent Control 1989 (Cat. No.89TH0282-4), P115, DOI 10.1109/ISIC.1989.238708
[8]  
Freeman J., 1992, NEURAL NETWORKS ALGO
[9]   A STOCHASTIC REINFORCEMENT LEARNING ALGORITHM FOR LEARNING REAL-VALUED FUNCTIONS [J].
GULLAPALLI, V .
NEURAL NETWORKS, 1990, 3 (06) :671-692
[10]  
LIN CJ, 1995, P 4 IEEE C FUZZ SYST, V3, P1299