THE EXISTENCE OF SENSITIVE OPTIMAL POLICIES IN TWO MULTI-DIMENSIONAL QUEUEING MODELS

被引：5

作者：

Spieksma, Flos ^{[1
]}

机构：

[1] Leiden Univ, Dept Math & Comp Sci, NL-2333 CA Leiden, Netherlands

来源：

ANNALS OF OPERATIONS RESEARCH | 1991年 / 28卷 / 01期

关键词：

Sensitive optimal policies; mu-uniform geometric convergence and recurrence; bounding vector; K competing queues; open Jackson network;

D O I：

10.1007/BF02055586

中图分类号：

C93 [管理学]; O22 [运筹学];

学科分类号：

070105 ; 12 ; 1201 ; 1202 ; 120202 ;

摘要：

Recently Dekker and Hordijk [3,4] introduced conditions for the existence of deterministic Blackwell optimal policies in denumerable Markov decision chains with unbounded rewards. These conditions include mu-uniform geometric recurrence. The mu-uniform geometric recurrence property also implies the existence of average optimal policies, a solution to the average optimality equation with explicit formula's and convergence of the value iteration algorithm for average rewards. For this reason, the verification of mu-uniform geometric convergence is also useful in cases where average and alpha-discounted rewards are considered. On the other hand, mu-uniform geometric recurrence is a heavy condition on the Markov decision chain structure for negative dynamic programming problems. The verification of mu-uniform geometric recurrence for the Markov chain induced by some deterministic policy together with results by Sennott [14] yields the existence of a deterministic policy that minimizes the expected average cost for non-negative immediate cost functions. In this paper mu-uniform geometric recurrence will be proved for two queueing models: the K competing queues and the two centre open Jackson network with control of the service rates.

引用

页码：273 / 295

页数：23

共 20 条

[1]

Baras J. S., 1985, SYSTEMS CONTROL LETT, V6, P186

[2] THE C-MU RULE REVISITED [J].

BUYUKKOC, C ;

VARAIYA, P ;

WALRAND, J .

ADVANCES IN APPLIED PROBABILITY, 1985, 17 (01) :237-238

[3] AVERAGE, SENSITIVE AND BLACKWELL OPTIMAL POLICIES IN DENUMERABLE MARKOV DECISION CHAINS WITH UNBOUNDED REWARDS [J].

DEKKER, R ;

HORDIJK, A .

MATHEMATICS OF OPERATIONS RESEARCH, 1988, 13 (03) :395-420

[4]

Dekker R., 1990, RELATION RE IN PRESS

[5]

Dekker R., 1989, MATH OPERAT IN PRESS

[6]

Hordijk A., 1990, CONVERGENCE IN PRESS

[7]

Hordijk A., 1989, ERGODICITY REC UNPUB

[8]

KENDALL DG, 1960, MATH METHODS SOCIAL, P176

[9]

Klimov G. P., 1974, Theory of Probability and Its Applications, V19, P532, DOI 10.1137/1119060

[10] CONDITIONS FOR EXISTENCE OF AVERAGE AND BLACKWELL OPTIMAL STATIONARY POLICIES IN DENUMERABLE MARKOV DECISION-PROCESSES [J].

LASSERRE, JB .

JOURNAL OF MATHEMATICAL ANALYSIS AND APPLICATIONS, 1988, 136 (02) :479-489

← 1 2 →