Optimal control of terminal processes using neural networks

被引:22
作者
Plumer, ES
机构
[1] Los Alamos National Laboratory, Los Alamos
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1996年 / 7卷 / 02期
基金
美国国家科学基金会;
关键词
D O I
10.1109/72.485676
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feedforward neural networks are capable of approximating continuous multivariate functions and, as such, can implement nonlinear state-feedback controllers. Training methods such as backpropagation-through-time (BPTT), however, do not deal with terminal control problems in which the specified cost function includes the elapsed trajectory-time. In this paper, an extension to BPTT is proposed which addresses this limitation, The controller design is reformulated as a constrained optimization problem defined over the entire field of extremals and in which the set of trajectory times is incorporated into the cost function. Necessary first-order stationary conditions are derived which correspond to standard BPTT with the addition of certain transversality conditions. The new gradient algorithm based on these conditions, called time-optimal backpropagation through time (TOBPTT), is tested on two benchmark minimum-time control problems.
引用
收藏
页码:408 / 418
页数:11
相关论文
共 29 条
[1]  
[Anonymous], 2010, Dynamic programming
[2]  
[Anonymous], 1969, APPL OPTIMAL CONTROL
[3]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[4]  
Bell D.J., 1975, SINGULAR OPTIMAL CON
[5]  
BHAT NV, 1990, IEEE CONTROL SYS APR, P24
[6]  
Cun YL, 1988, P 1988 CONN MOD SUMM, P21
[7]  
GUEZ A, 1988, P IEEE INT C NEUR NE, V1, P617
[8]  
Hernandez E., 1990, Proceedings of the 1990 American Control Conference (IEEE Cat. No.90CH2896-9), P2454
[9]  
JACKSON DH, 1970, DIFFERENTIAL DYNAMIC
[10]  
JOSIN G, 1988, P INT JOINT C NEUR N, V2, P625