非线性离散时间系统带ε误差限的自适应动态规划

被引：3

作者：

林小峰

张衡

宋绍剑

宋春宁

机构：

[1] 广西大学电气工程学院

来源：

控制与决策 | 2011年 / 26卷 / 10期

关键词：

最优控制; 离散时间系统; 自适应动态规划; 神经网络; ε误差限;

D O I：

10.13195/j.cd.2011.10.149.linxf.019

中图分类号：

TP13 [自动控制理论];

学科分类号：

摘要：

为了获得非线性离散时间系统的最优控制策略,基于自适应动态规划的原理,提出了一种带误差限的自适应动态规划方法.对于一个任意的状态,用一个有限长度的控制序列近似最优控制序列,使性能指标与最优性能指标的误差在一个较小的范围内.选取一个非线性离散时间系统对算法的性能进行数值实验,结果验证了该算法的有效性,用较少的计算代价获得了近似最优的控制策略.

引用

页码：1586 / 1590+1595 +1595

页数：6

共 10 条

[1] Training Strategies for Critic and Action Neural Networks in Dual Heuristic Programming Method. Lendaris G G,Paintz C. Proc. of the IEEE International Conference on Neural Networks . 1997
[2] A menu of designs for reinforcement learningover time,in neural networks for control. Werbos P J. MIT Press . 1990
[3] ε-adaptive dynamic programming for discrete-time systems. Liu D,Jin N. Proceedings of the IEEE International Joint Conference on Neural Networks . 2008
[4] On-line learning control by association and reinforcement. Si J, Wang Y T. IEEE Transactions on Neural Networks . 2001
[5] Adaptive Critic Designs. Prokhorov D V,Wunsch D C. IEEE Transactions on Neural Networks . 1997
[6] Adaptive dynamicprogramming for finite-horizon optimal control of discrete-time nonlinear systems withε-error bound. Wang F Y,Jin N,Liu D R,et al. IEEE Transactions on Neural Networks . 2010
[7] Finite horizon discrete-time approximatedynamic programming. Liu D R,Jin N. Proc of the 2006 IEEE IntSymposium on Intelligent Control . 2006
[8] A novel infinite-time optimal tracking control scheme for a class ofdiscrete-time nonlinear systems via the greedy HDPiteration algorithm. Zhang H G,Wei Q L,Luo Y H. IEEE Trans on Systems,Man andCybernetics . 2008
[9] Adaptive dynamic programming fordiscrete-time systems with infinite horizon andε-errorbound in the performance cost. Liu D R,Jin N. Proc of Int Joint Confon Neural Networks . 2009
[10] Discrete-time non-linear HJB solution using approximate dynamic programming:con-vergence proof. TAMIMI A A,LEWIS F L,ABU-KHALAF M. IEEE Transactions on Systems,Man, and Cyber-netics,part B:Cybernetics . 2008

← 1 →