OPTIMALITY OF STRUCTURED POLICIES IN COUNTABLE STAGE DECISION PROCESSES

被引：36

作者：

PORTEUS, EL ^{[1
]}

机构：

[1] STANFORD UNIV,GRAD SCH BUSINESS,STANFORD,CA 94305

来源：

MANAGEMENT SCIENCE | 1975年 / 22卷 / 02期

关键词：

MANAGEMENT SCIENCE - OPTIMIZATION - PROBABILITY;

D O I：

10.1287/mnsc.22.2.148

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

Multistage decision processes are considered, in notation which is an outgrowth of that introduced by Denardo. Certain Markov decision processes, stochastic games, and ris-sensitive Markov decision processes can be formulated in this notation. Conditions are identified which are sufficient to prove that, in infinite horizon nonstationary processes, the optimal infinite horizon (present) value exists, is uniquely defined, is that is called ″structured,″ and can be found by solving Bellman's optimality equations; epsilon -optimal strategies exist; an optimal strategy can be found by applying Bellman's optimality criterion; and a specially identified kind of policy, called a ″structured″ policy is optimal in each stage.

引用

页码：148 / 157

页数：10

共 16 条

[1]

[Anonymous], 1957, GAMES DECIS

[2]

Bellman R., 1957, DYNAMIC PROGRAMMING

[3]

Blackwell D., 1965, ANN MATH STAT, V36, P226

[4] CONTRACTION MAPPINGS IN THEORY UNDERLYING DYNAMIC PROGRAMMING [J].

DENARDO, EV .

SIAM REVIEW, 1967, 9 (02) :165-&

[5]

DERMAN C, 1963, MATHEMATICAL OPTIMIZ

[6]

HINDERER K, 1970, F NONSTATIONARY DYNA

[7]

Hordijk A, 1974, DYNAMIC PROGRAMMING

[8] RISK-SENSITIVE MARKOV DECISION PROCESSES [J].

HOWARD, RA ;

MATHESON, JE .

MANAGEMENT SCIENCE SERIES A-THEORY, 1972, 18 (07) :356-369

[9]

JAQUETTE S, TO BE PUBLISHED

[10]

Maitra A, 1968, SANKHYA A, V30, P211

← 1 2 →