A FORWARD METHOD FOR OPTIMAL STOCHASTIC NONLINEAR AND ADAPTIVE-CONTROL

被引:32
作者
BAYARD, DS
机构
[1] Jet Propulsion Laboratory, California Institute of Technology
关键词
D O I
10.1109/9.83535
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A computational approach is taken to solve the optimal partially observed nonlinear stochastic control problem. The approach is to systematically solve the stochastic dynamic programming equations forward in time, using a nested stochastic approximation technique. Although computationally intensive, this provides a straightforward numerical solution for this class of problems and provides an alternative to the usual "curse of dimensionality" associated with solving the dynamic programming equations backwards in time. In particular, the "curse" is seen to take a new form, where the amount of computation depends on the amount of uncertainty in the problem and the length of the horizon. As a matter of more practical interest, it is shown that the cost degrades monotonically as the complexity of the algorithm is reduced. This provides a strategy for suboptimal control with clear performance/computation trade-offs. Due to a strong connection with the work of Bellman on policy iteration, the method is denoted as the iteration in policy space (IPS) algorithm. A numerical study focusing on a generic optimal stochastic adaptive control example is included to demonstrate the feasibility of the method.
引用
收藏
页码:1046 / 1053
页数:8
相关论文
共 31 条
[1]   NONLINEAR BAYESIAN ESTIMATION USING GAUSSIAN SUM APPROXIMATIONS [J].
ALSPACH, DL ;
SORENSON, HW .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1972, AC17 (04) :439-&
[2]  
[Anonymous], 1961, ADAPTIVE CONTROL PRO
[3]  
ASTROM KJ, 1971, J MATH ANAL APPL, V34
[4]  
ASTROM KJ, 1983, OUTILS MODELES MATH, V3, P741
[5]  
Bar-Shalom Y., 1976, CONTROL DYNAMIC SYST, V12, P99, DOI 10.1016/B978-0-12-012712-2.50009-3
[6]   IMPLICIT DUAL CONTROL FOR GENERAL STOCHASTIC-SYSTEMS [J].
BAYARD, DS ;
ESLAMI, M .
OPTIMAL CONTROL APPLICATIONS & METHODS, 1985, 6 (03) :265-279
[7]  
BAYARD DS, 1987, JPL D4902 INT DOC
[8]  
BAYARD DS, 1987, IEEE T AUTOMAT CONTR, V32
[9]  
BAYARD DS, 1984, THESIS STATE U NEW Y
[10]  
BAYARD DS, 1983, ADAPTIVE SYSTEMS CON