多变量环境下基于递阶模糊神经网络的强化学习

被引：2

作者：

张文志

吕恬生

王乐天

机构：

[1] 上海交通大学机械与动力工程学院

[2] 上海交通大学机械与动力工程学院上海

[3] 上海

来源：

上海交通大学学报 | 2004年 / 09期

关键词：

模糊系统; 递阶模糊神经网络; 强化学习; 二级倒立摆;

D O I：

10.16183/j.cnki.jsjtu.2004.09.030

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

针对多变量连续空间学习问题的复杂性,给出了一种采用递阶模糊神经网络(HFNN)的强化学习方法,两个结构相同的HFNN分别同时完成模糊动作的合成以及值函数的逼近,网络参数通过梯度下降法在线调整.该方法有效地解决了在多变量环境下所遇到的规则组合爆炸问题,减少了运算量和存储量.HFNN前一阶的输出不再作为下一阶的前件,而直接用于其结论部分,克服了前一阶输出含义不明确或没有含义所带来的设计问题.通过仿真二级倒立摆验证表明,所给出方法是正确可行的.

引用

页码：1557 / 1561

页数：5

共 8 条

[1]

Reinforcement structure/parameter learning for neural-network-based fuzzy logic control sys-tems. LIN C T,LEE C S G. IEEE Transactions on Fuzzy Systems . 1994

[2]

Neural -like adaptive elements that can solve difficult learning control problem. Barto Andrew G,Sutton Richard S. IEEE Transactions on Systems Man and Cybernetics . 1983

[3]

Using fuzzy logic for performance evaluation in reinforcement learning. BERENJI H R,KHEDKAR P S. In-ternational Journal of Approximate Reasoning . 1998

[4]

Fuzzy inference system learning by reinforcement methods. Jouffe L. IEEE Transactions on Systems Man and Cybernetics . 1998

[5]

A note on universal approximation by hierarchical fuzzy systems. Chen W,Wang Li -xin. Journal of Information Science . 2000

[6]

Analysis and design of hierarchical fuzzy systems. Wang Li -xin. IEEE Transactions on Fuzzy Systems . 1999

[7]

Hierarchical fuzzy control. RAJU G V S,ZHOU G,KISNER R A. International Journal of Control . 1991

[8]

Universal approximation by hierarchical fuzzy system with constraints on the fuzzy rule. Joo Moon G,Lee Jin S. Fuzzy Sets and Systems . 2002

← 1 →