RECENT RESULTS ON CONDITIONS FOR THE EXISTENCE OF AVERAGE OPTIMAL STATIONARY POLICIES

被引:11
作者
Cavazos-Cadena, Rolando [1 ]
机构
[1] Univ Autonoma Agr Antonio Narro, Dept Estadist & Calculo, Buenavista 25315, Saltillo Coah, Mexico
关键词
Markov decision processes; average reward criterion; bounded solutions to the optimality equation; simultaneous Doeblin condition; equivalence of average reward criteria;
D O I
10.1007/BF02055572
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
This paper concerns countable state space Markov decision processes endowed with a (long-run expected) average reward criterion. For these models we summarize and, in some cases, extend some recent results on sufficient conditions to establish the existence of optimal stationary policies. The topics considered are the following: (i) the new assumptions introduced by Sennott in [20-23], (ii) necessary and sufficient conditions for the existence of a bounded solution to the optimality equation, and (iii) equivalence of average optimality criteria. Some problems are posed.
引用
收藏
页码:3 / 27
页数:25
相关论文
共 27 条
[1]  
Ash R. B., 2014, REAL ANAL PROBABILIT
[2]   2 COMPETING QUEUES WITH LINEAR COSTS AND GEOMETRIC SERVICE REQUIREMENTS - THE MU-C-RULE IS OFTEN OPTIMAL [J].
BARAS, JS ;
DORSEY, AJ ;
MAKOWSKI, AM .
ADVANCES IN APPLIED PROBABILITY, 1985, 17 (01) :186-209
[3]  
Borkar V.S., 1989, SIAM J CONTROL OPTIM, V27, P965
[4]   CONTROLLED MARKOV-CHAINS AND STOCHASTIC NETWORKS [J].
BORKAR, VS .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1983, 21 (04) :652-666
[5]   ON MINIMUM COST PER UNIT TIME CONTROL OF MARKOV-CHAINS [J].
BORKAR, VS .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1984, 22 (06) :965-978
[6]  
Cavazos-Cadena R., 1989, J APPL MATH OPTIMIZA, V19, P97
[7]  
Cavazos-Cadena R., COMP RECENT AS UNPUB
[8]   NECESSARY AND SUFFICIENT CONDITIONS FOR A BOUNDED SOLUTION TO THE OPTIMALITY EQUATION IN AVERAGE REWARD MARKOV DECISION CHAINS [J].
CAVAZOSCADENA, R .
SYSTEMS & CONTROL LETTERS, 1988, 10 (01) :71-78
[9]  
CAVAZOSCADENA R, 1989, KYBERNETIKA, V25, P145
[10]  
CAVAZOSCADENA R, 1991, KYBERNETIKA, V27, P23