EQUIVALENCE OF LYAPUNOV STABILITY-CRITERIA IN A CLASS OF MARKOV DECISION-PROCESSES

被引:12
作者
CAVAZOSCADENA, R [1 ]
HERNANDEZLERMA, O [1 ]
机构
[1] IST POLYTECH NACL,CINVESTAV,DEPT MATEMAT,MEXICO CITY 07000,DF,MEXICO
关键词
MARKOV DECISION PROCESSES; AVERAGE REWARD CRITERION; LYAPUNOV STABILITY CRITERIA; LYAPUNOV FUNCTION CONDITION; CONTINUITY OF THE INVARIANT DISTRIBUTION; RENEWAL THEORY;
D O I
10.1007/BF01189027
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
We are concerned with Markov decision processes with countable state space and discrete-time parameter. The main structural restriction on the model is the following: under the action of any stationary policy the state space is a communicating class. In this context, we prove the equivalence of ten stability/ergodicity conditions on the transition law of the model, which imply the existence of average optimal stationary policies for an arbitrary continuous and bounded reward function; these conditions include the Lyapunov function condition (LFC) introduced by A. Hordijk. As a consequence of our results, the LFC is proved to be equivalent to the following: under the action of any stationary policy the corresponding Markov chain has a unique invariant distribution which depends continuously on the stationary policy being used. A weak form of the latter condition was used by one of the authors to establish the existence of optimal stationary policies using an approach based on renewal theory.
引用
收藏
页码:113 / 137
页数:25
相关论文
共 13 条
[1]  
Ash R. B., 2014, REAL ANAL PROBABILIT
[2]   EXISTENCE OF OPTIMAL STATIONARY POLICIES IN AVERAGE REWARD MARKOV DECISION-PROCESSES WITH A RECURRENT STATE [J].
CAVAZOSCADENA, R .
APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02) :171-194
[3]  
Dugundji J., 1977, TOPOLOGY
[4]   ON THE STOCHASTIC MATRICES ASSOCIATED WITH CERTAIN QUEUING PROCESSES [J].
FOSTER, FG .
ANNALS OF MATHEMATICAL STATISTICS, 1953, 24 (03) :355-360
[5]  
Hernandez-Lerma O., 1989, ADAPTIVE MARKOV CONT
[6]  
HINDERER K, 1970, F NONSTATIONARY DYNA
[7]  
HORDJIK A, 1977, MATH CTR TRACTS, V51
[8]  
KOLONKO M, 1982, MATH OPERATIONSFORSH, V18, P567
[9]  
Loeve M., 1977, PROBABILITY THEORY, V1
[10]  
MANDL P, 1979, PROBABILITY THEORY, V5, P159