Value iteration and optimization of multiclass queueing networks

被引:2
作者
Rong-Rong Chen
Sean Meyn
机构
[1] University of Illinois and the Coordinated Science Laboratory,
来源
Queueing Systems | 1999年 / 32卷
关键词
multiclass queueing networks; Markov decision processes; optimal control; dynamic programming;
D O I
暂无
中图分类号
学科分类号
摘要
This paper considers in parallel the scheduling problem for multiclass queueing networks, and optimization of Markov decision processes. It is shown that the value iteration algorithm may perform poorly when the algorithm is not initialized properly. The most typical case where the initial value function is taken to be zero may be a particularly bad choice. In contrast, if the value iteration algorithm is initialized with a stochastic Lyapunov function, then the following hold: (i) a stochastic Lyapunov function exists for each intermediate policy, and hence each policy is regular (a strong stability condition), (ii) intermediate costs converge to the optimal cost, and (iii) any limiting policy is average cost optimal. It is argued that a natural choice for the initial value function is the value function for the associated deterministic control problem based upon a fluid model, or the approximate solution to Poisson’s equation obtained from the LP of Kumar and Meyn. Numerical studies show that either choice may lead to fast convergence to an optimal policy.
引用
收藏
页码:65 / 97
页数:32
相关论文
共 28 条
[1]  
Arapostathis A.(1993)Discretetime controlled Markov processes with average cost criterion: A survey SIAM J. Control Optim. 31 282-344
[2]  
Borkar V.S.(1991)Discrete flow networks: Bottlenecks analysis and fluid approximations Math. Oper. Res. 16 408-446
[3]  
Fernandez-Gaucherand E.(1995)On the positive Harris recurrence for multiclass queueing networks: A unified approach via fluid limit models Ann. Appl. Probab. 5 49-77
[4]  
Ghosh M.K.(1995)Stability and convergence of moments for multiclass queueing networks via fluid limit models IEEE Trans. Automat. Control 40 1889-1904
[5]  
Marcus S.I.(1989)Scheduling networks of queues: Heavy traffic analysis of a simple open network Queueing Systems 5 265-280
[6]  
Chen H.(1994)Performance bounds for queueing networks and scheduling policies IEEE Trans. Automat. Control 39 1600-1611
[7]  
Mandelbaum A.(1995)Stability of queueing networks and scheduling policies IEEE Trans. Automat. Control 40 251-260
[8]  
Dai J.G.(1996)Duality and linear programs for stability and performance analysis queueing networks and scheduling policies IEEE Trans. Automat. Control 41 4-17
[9]  
Dai J.G.(1990)Dynamic instabilities and stabilization methods in distributed realtime scheduling of manufacturing systems IEEE Trans. Automat. Control 35 289-298
[10]  
Meyn S.P.(1996)Heavy traffice convergence of a controlled, multiclass queueing system SIAM J. Control Optim. 34 2133-2171