非线性离散时间系统带ε误差限的自适应动态规划

被引:3
作者
林小峰
张衡
宋绍剑
宋春宁
机构
[1] 广西大学电气工程学院
关键词
最优控制; 离散时间系统; 自适应动态规划; 神经网络; ε误差限;
D O I
10.13195/j.cd.2011.10.149.linxf.019
中图分类号
TP13 [自动控制理论];
学科分类号
摘要
为了获得非线性离散时间系统的最优控制策略,基于自适应动态规划的原理,提出了一种带误差限的自适应动态规划方法.对于一个任意的状态,用一个有限长度的控制序列近似最优控制序列,使性能指标与最优性能指标的误差在一个较小的范围内.选取一个非线性离散时间系统对算法的性能进行数值实验,结果验证了该算法的有效性,用较少的计算代价获得了近似最优的控制策略.
引用
收藏
页码:1586 / 1590+1595 +1595
页数:6
相关论文
共 10 条
  • [1] Training Strategies for Critic and Action Neural Networks in Dual Heuristic Programming Method. Lendaris G G,Paintz C. Proc. of the IEEE International Conference on Neural Networks . 1997
  • [2] A menu of designs for reinforcement learningover time,in neural networks for control. Werbos P J. MIT Press . 1990
  • [3] ε-adaptive dynamic programming for discrete-time systems. Liu D,Jin N. Proceedings of the IEEE International Joint Conference on Neural Networks . 2008
  • [4] On-line learning control by association and reinforcement. Si J, Wang Y T. IEEE Transactions on Neural Networks . 2001
  • [5] Adaptive Critic Designs. Prokhorov D V,Wunsch D C. IEEE Transactions on Neural Networks . 1997
  • [6] Adaptive dynamicprogramming for finite-horizon optimal control of discrete-time nonlinear systems withε-error bound. Wang F Y,Jin N,Liu D R,et al. IEEE Transactions on Neural Networks . 2010
  • [7] Finite horizon discrete-time approximatedynamic programming. Liu D R,Jin N. Proc of the 2006 IEEE IntSymposium on Intelligent Control . 2006
  • [8] A novel infinite-time optimal tracking control scheme for a class ofdiscrete-time nonlinear systems via the greedy HDPiteration algorithm. Zhang H G,Wei Q L,Luo Y H. IEEE Trans on Systems,Man andCybernetics . 2008
  • [9] Adaptive dynamic programming fordiscrete-time systems with infinite horizon andε-errorbound in the performance cost. Liu D R,Jin N. Proc of Int Joint Confon Neural Networks . 2009
  • [10] Discrete-time non-linear HJB solution using approximate dynamic programming:con-vergence proof. TAMIMI A A,LEWIS F L,ABU-KHALAF M. IEEE Transactions on Systems,Man, and Cyber-netics,part B:Cybernetics . 2008