OPTIMALITY OF STRUCTURED POLICIES IN COUNTABLE STAGE DECISION PROCESSES

被引:36
作者
PORTEUS, EL [1 ]
机构
[1] STANFORD UNIV,GRAD SCH BUSINESS,STANFORD,CA 94305
关键词
MANAGEMENT SCIENCE - OPTIMIZATION - PROBABILITY;
D O I
10.1287/mnsc.22.2.148
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
Multistage decision processes are considered, in notation which is an outgrowth of that introduced by Denardo. Certain Markov decision processes, stochastic games, and ris-sensitive Markov decision processes can be formulated in this notation. Conditions are identified which are sufficient to prove that, in infinite horizon nonstationary processes, the optimal infinite horizon (present) value exists, is uniquely defined, is that is called ″structured,″ and can be found by solving Bellman's optimality equations; epsilon -optimal strategies exist; an optimal strategy can be found by applying Bellman's optimality criterion; and a specially identified kind of policy, called a ″structured″ policy is optimal in each stage.
引用
收藏
页码:148 / 157
页数:10
相关论文
共 16 条
[1]  
[Anonymous], 1957, GAMES DECIS
[2]  
Bellman R., 1957, DYNAMIC PROGRAMMING
[3]  
Blackwell D., 1965, ANN MATH STAT, V36, P226
[4]   CONTRACTION MAPPINGS IN THEORY UNDERLYING DYNAMIC PROGRAMMING [J].
DENARDO, EV .
SIAM REVIEW, 1967, 9 (02) :165-&
[5]  
DERMAN C, 1963, MATHEMATICAL OPTIMIZ
[6]  
HINDERER K, 1970, F NONSTATIONARY DYNA
[7]  
Hordijk A, 1974, DYNAMIC PROGRAMMING
[8]   RISK-SENSITIVE MARKOV DECISION PROCESSES [J].
HOWARD, RA ;
MATHESON, JE .
MANAGEMENT SCIENCE SERIES A-THEORY, 1972, 18 (07) :356-369
[9]  
JAQUETTE S, TO BE PUBLISHED
[10]  
Maitra A, 1968, SANKHYA A, V30, P211