OPTIMAL INFINITE-HORIZON UNDISCOUNTED CONTROL OF FINITE PROBABILISTIC SYSTEMS

被引:42
作者
PLATZMAN, LK [1 ]
机构
[1] BELL TEL LABS INC,NAPERVILLE,IL 60540
关键词
D O I
10.1137/0318028
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
A finite-point, finite-state, finite-output stochastic control problem with imperfect state observation and-classical information pattern is shown to be meaningful as the horizon increases without bound and the discount rate approaches unity. The plant model, a finite probabilistic system, includes the Markov decision and partially-observed Markov decision problems as special cases. Under conditions resembling controllability and observability in linear systems it is shown that: an optimal strategy exists, it may be realized by a stationary policy on the state estimate, its performance does not depend on the initial state distribution, and convergence rates for its finite-horizon and discounted performances are readily established.
引用
收藏
页码:362 / 380
页数:19
相关论文
共 33 条
[2]   OPTIMAL CONTROL OF MARKOV PROCESSES WITH INCOMPLETE STATE-INFORMATION .2. CONVEXITY OF LOSSFUNCTION [J].
ASTROM, KJ .
JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1969, 26 (02) :403-&
[4]   DUAL EFFECT, CERTAINTY EQUIVALENCE, AND SEPARATION IN STOCHASTIC CONTROL [J].
BARSHALOM, Y ;
TSE, E .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (05) :494-500
[5]   A MARKOVIAN DECISION PROCESS [J].
BELLMAN, R .
JOURNAL OF MATHEMATICS AND MECHANICS, 1957, 6 (05) :679-684
[6]  
Bertsekas D., 1976, DYNAMIC PROGRAMMING
[7]  
BROOKS DM, 1973, OPERATIONS RES, V21, P904
[8]   2-ARMED-BANDIT PROBLEM WITH TIME-INVARIANT FINITE MEMORY [J].
COVER, TM ;
HELLMAN, ME .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1970, 16 (02) :185-+
[9]  
DeGroot, 1970, OPTIMAL STAT DECISIO, V82
[10]  
Derman C., 1970, FINITE STATE MARKOVI