THE EXISTENCE OF SENSITIVE OPTIMAL POLICIES IN TWO MULTI-DIMENSIONAL QUEUEING MODELS

被引:5
作者
Spieksma, Flos [1 ]
机构
[1] Leiden Univ, Dept Math & Comp Sci, NL-2333 CA Leiden, Netherlands
关键词
Sensitive optimal policies; mu-uniform geometric convergence and recurrence; bounding vector; K competing queues; open Jackson network;
D O I
10.1007/BF02055586
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Recently Dekker and Hordijk [3,4] introduced conditions for the existence of deterministic Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. These conditions include mu-uniform geometric recurrence. The mu-uniform geometric recurrence property also implies the existence of average optimal policies, a solution to the average optimality equation with explicit formula's and convergence of the value iteration algorithm for average rewards. For this reason, the verification of mu-uniform geometric convergence is also useful in cases where average and alpha-discounted rewards are considered. On the other hand, mu-uniform geometric recurrence is a heavy condition on the Markov decision chain structure for negative dynamic programming problems. The verification of mu-uniform geometric recurrence for the Markov chain induced by some deterministic policy together with results by Sennott [14] yields the existence of a deterministic policy that minimizes the expected average cost for non-negative immediate cost functions. In this paper mu-uniform geometric recurrence will be proved for two queueing models: the K competing queues and the two centre open Jackson network with control of the service rates.
引用
收藏
页码:273 / 295
页数:23
相关论文
共 20 条
[1]  
Baras J. S., 1985, SYSTEMS CONTROL LETT, V6, P186
[2]   THE C-MU RULE REVISITED [J].
BUYUKKOC, C ;
VARAIYA, P ;
WALRAND, J .
ADVANCES IN APPLIED PROBABILITY, 1985, 17 (01) :237-238
[3]   AVERAGE, SENSITIVE AND BLACKWELL OPTIMAL POLICIES IN DENUMERABLE MARKOV DECISION CHAINS WITH UNBOUNDED REWARDS [J].
DEKKER, R ;
HORDIJK, A .
MATHEMATICS OF OPERATIONS RESEARCH, 1988, 13 (03) :395-420
[4]  
Dekker R., 1990, RELATION RE IN PRESS
[5]  
Dekker R., 1989, MATH OPERAT IN PRESS
[6]  
Hordijk A., 1990, CONVERGENCE IN PRESS
[7]  
Hordijk A., 1989, ERGODICITY REC UNPUB
[8]  
KENDALL DG, 1960, MATH METHODS SOCIAL, P176
[9]  
Klimov G. P., 1974, Theory of Probability and Its Applications, V19, P532, DOI 10.1137/1119060