Optimal control of unknown nonaffine nonlinear discrete-time systems based on adaptive dynamic programming

被引：341

作者：

Wang, Ding ^{[1
]}

Liu, Derong ^{[1
]}

Wei, Qinglai ^{[1
]}

Zhao, Dongbin ^{[1
]}

Jin, Ning ^{[2
]}

机构：

[1] Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China

[2] Univ Illinois, Dept Elect & Comp Engn, Chicago, IL 60607 USA

来源：

AUTOMATICA | 2012年 / 48卷 / 08期

基金：

北京市自然科学基金; 中国国家自然科学基金;

关键词：

Adaptive critic designs; Adaptive dynamic programming; Approximate dynamic programming; Globalized dual heuristic programming; Intelligent control; Neural network; Optimal control; FEEDBACK-CONTROL; REINFORCEMENT; ADP;

D O I：

10.1016/j.automatica.2012.05.049

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

An intelligent-optimal control scheme for unknown nonaffine nonlinear discrete-time systems with discount factor in the cost function is developed in this paper. The iterative adaptive dynamic programming algorithm is introduced to solve the optimal control problem with convergence analysis. Then, the implementation of the iterative algorithm via globalized dual heuristic programming technique is presented by using three neural networks, which will approximate at each iteration the cost function, the control law, and the unknown nonlinear system, respectively. In addition, two simulation examples are provided to verify the effectiveness of the developed optimal control approach. (C) 2012 Elsevier Ltd. All rights reserved.

引用

页码：1825 / 1832

页数：8

共 29 条

[1] Nearly optimal control laws for nonlinear systems with saturating actuators using a neural network HJB approach [J].

Abu-Khalaf, M ;

Lewis, FL .

AUTOMATICA, 2005, 41 (05) :779-791

[2] Discrete-time nonlinear HJB solution using approximate dynamic programming: Convergence proof [J].

Al-Tamimi, Asma ;

Lewis, Frank L. ;

Abu-Khalaf, Murad .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :943-949

[3]

[Anonymous], 1996, Neuro-dynamic programming

[4] Issues on stability of ADP feedback controllers for dynamical systems [J].

Balakrishnan, S. N. ;

Ding, Jie ;

Lewis, Frank L. .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS, 2008, 38 (04) :913-917

[5] Adaptive-critic-based neural networks for aircraft optimal control [J].

Balakrishnan, SN ;

Biega, V .

JOURNAL OF GUIDANCE CONTROL AND DYNAMICS, 1996, 19 (04) :893-898

[6] Galerkin approximations of the generalized Hamilton-Jacobi-Bellman equation [J].

Beard, RW ;

Saridis, GN ;

Wen, JT .

AUTOMATICA, 1997, 33 (12) :2159-2177

[7]

Bellman R. E., 1957, Dynamic programming. Princeton landmarks in mathematics

[8] On infinite-time nonlinear quadratic optimal control [J].

Chen, Y ;

Edgar, T ;

Manousiouthakis, V .

SYSTEMS & CONTROL LETTERS, 2004, 51 (3-4) :259-268

[9] Optimal control of unknown affine nonlinear discrete-time systems using offline-trained neural networks with proof of convergence [J].

Dierks, Travis ;

Thumati, Balaje T. ;

Jagannathan, S. .

NEURAL NETWORKS, 2009, 22 (5-6) :851-860

[10]

Jagannathan S., 2006, Neural Network Control of Nonlinear Discrete-Time Systems

← 1 2 3 →