Value iteration in a class of average controlled Markov chains with unbounded costs: Necessary and sufficient conditions for pointwise convergence

被引:8
作者
CavazosCadena, R
FernandezGaucherand, E
机构
[1] UNIV AUTONOMA AGRARIA ANTONIO NARRO, DEPT ESTADIST & CALCULO, SALTILLO 25315, COAHUILA, MEXICO
[2] UNIV ARIZONA, DEPT IND & SYST ENGN, TUCSON, AZ 85721 USA
关键词
controlled Markov chains; average cost criterion; Lyapunov function condition; value iteration scheme; pointwise convergence; necessary and sufficient conditions;
D O I
10.2307/3214980
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
This work concerns controlled Markov chains with denumerable state space, (possibly) unbounded cost function, and an expected average cost criterion. Under a Lyapunov function condition, together with mild continuity-compactness assumptions, a simple necessary and sufficient criterion is given so that the relative value functions and differential costs produced by the value iteration scheme converge pointwise to the solution of the optimality equation; this criterion is applied to obtain convergence results when the cost function is bounded below or bounded above.
引用
收藏
页码:986 / 1002
页数:17
相关论文
共 22 条
[1]  
[Anonymous], MATH CTR TRACT
[2]  
[Anonymous], 1991, PITMAN RES NOTES MAT
[3]   DISCRETE-TIME CONTROLLED MARKOV-PROCESSES WITH AVERAGE COST CRITERION - A SURVEY [J].
ARAPOSTATHIS, A ;
BORKAR, VS ;
FERNANDEZGAUCHERAND, E ;
GHOSH, MK ;
MARCUS, SI .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1993, 31 (02) :282-344
[4]  
Cavazos-Cadena R., 1995, ZOR-Mathematical Methods of Operations Research, V41, P89, DOI 10.1007/BF01415067
[5]   RECENT RESULTS ON CONDITIONS FOR THE EXISTENCE OF AVERAGE OPTIMAL STATIONARY POLICIES [J].
Cavazos-Cadena, Rolando .
ANNALS OF OPERATIONS RESEARCH, 1991, 28 (01) :3-27
[6]   EQUIVALENCE OF LYAPUNOV STABILITY-CRITERIA IN A CLASS OF MARKOV DECISION-PROCESSES [J].
CAVAZOSCADENA, R ;
HERNANDEZLERMA, O .
APPLIED MATHEMATICS AND OPTIMIZATION, 1992, 26 (02) :113-137
[7]  
CAVAZOSCADENA R, 1994, IN PRESS B SOCIEDAD
[8]  
CAVAZOSCADENA R, 1995, IN PRESS J MATH SYST
[9]  
Dugundji J., 1966, TOPOLOGY
[10]  
FEDERGRUEN A., 1978, DYNAMIC PROGRAMMING, P23, DOI [10.1016/B978-0-12-568150-6.50008-8, DOI 10.1016/B978-0-12-568150-6.50008-8]