Optimal control of terminal processes using neural networks

被引：22

作者：

Plumer, ES

机构：

[1] Los Alamos National Laboratory, Los Alamos

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS | 1996年 / 7卷 / 02期

基金：

美国国家科学基金会;

关键词：

D O I：

10.1109/72.485676

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Feedforward neural networks are capable of approximating continuous multivariate functions and, as such, can implement nonlinear state-feedback controllers. Training methods such as backpropagation-through-time (BPTT), however, do not deal with terminal control problems in which the specified cost function includes the elapsed trajectory-time. In this paper, an extension to BPTT is proposed which addresses this limitation, The controller design is reformulated as a constrained optimization problem defined over the entire field of extremals and in which the set of trajectory times is incorporated into the cost function. Necessary first-order stationary conditions are derived which correspond to standard BPTT with the addition of certain transversality conditions. The new gradient algorithm based on these conditions, called time-optimal backpropagation through time (TOBPTT), is tested on two benchmark minimum-time control problems.

引用

页码：408 / 418

页数：11

共 29 条

[1]

[Anonymous], 2010, Dynamic programming

[2]

[Anonymous], 1969, APPL OPTIMAL CONTROL

[3] NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].

BARTO, AG ;

SUTTON, RS ;

ANDERSON, CW .

IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846

[4]

Bell D.J., 1975, SINGULAR OPTIMAL CON

[5]

BHAT NV, 1990, IEEE CONTROL SYS APR, P24

[6]

Cun YL, 1988, P 1988 CONN MOD SUMM, P21

[7]

GUEZ A, 1988, P IEEE INT C NEUR NE, V1, P617

[8]

Hernandez E., 1990, Proceedings of the 1990 American Control Conference (IEEE Cat. No.90CH2896-9), P2454

[9]

JACKSON DH, 1970, DIFFERENTIAL DYNAMIC

[10]

JOSIN G, 1988, P INT JOINT C NEUR N, V2, P625

← 1 2 3 →