OPTIMAL CONTROL OF PARTIALLY OBSERVABLE MARKOV PROCESSES OVER A FINITE HORIZON

被引:755
作者
SMALLWOOD, RD
SONDIK, EJ
机构
[1] STANFORD UNIV, STANFORD, CA 94305 USA
[2] XEROX PALO ALTO RES CTR, PALO ALTO, CA 94300 USA
关键词
MAINTENANCE - MATHEMATICAL MODELS - PROBABILITY;
D O I
10.1287/opre.21.5.1071
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
The paper formulates the optimal control problem for a class of mathematical models in which the system to be controlled is characterized by a finite-state discrete-time Markov process. The formulation is illustrated by a simple machine-maintenance example, and other specific application areas are also discussed. The paper demonstrates that, if there are only a finite number of control intervals remaining, then the optimal payoff function is a piecewise-linear, convex function of the current state probabilities of the internal Markov process.
引用
收藏
页码:1071 / 1088
页数:18
相关论文
共 9 条
[1]  
DRAKE A, 1962, THESIS MASSACHUSETTS
[2]   OPTIMUM MAINTENANCE WITH INCOMPLETE INFORMATION [J].
ECKLES, JE .
OPERATIONS RESEARCH, 1968, 16 (05) :1058-&
[3]  
Howard R., 1960, DYNAMIC PROGRAMMING
[4]  
MATHESON J, 1964, CCS2 EES TECHN REP
[5]   A SIMPLE MODEL OF SEARCH FOR A MOVING TARGET [J].
POLLOCK, SM .
OPERATIONS RESEARCH, 1970, 18 (05) :883-&
[6]   ANALYSIS OF ECONOMIC TEACHING STRATEGIES FOR A SIMPLE LEARNING MODE [J].
SMALLWOOD, RD .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1971, 8 (02) :285-+
[7]  
SMALLWOOD RD, 1967, NONR22584 STANF U DE
[8]  
SMALLWOOD RD, 1970, COMPUTER ASSISTED IN
[9]  
Sondik E. J., 1971, THESIS STANFORD U