Scheduling a multi class queue with many exponential servers: Asymptotic optimality in heavy traffic

被引:86
作者
Atar, R [1 ]
Mandelbaum, A
Reiman, MI
机构
[1] Technion Israel Inst Technol, Dept Elect Engn, IL-32000 Haifa, Israel
[2] Technion Israel Inst Technol, Dept Ind Engn & Management, IL-32000 Haifa, Israel
[3] Bell Labs, Lucent Technol, Murray Hill, NJ 07974 USA
关键词
multiclass queues; multiserver queues; queues with abandonment; heavy traffic; Halfin-Whitt (QED) regime; call centers; dynamic control; diffusion approximation; optimal control of diffusion; HJB equation; asymptotic optimality;
D O I
10.1214/105051604000000233
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of scheduling a queueing system in which many statistically identical servers cater to several classes of impatient customers. Service times and impatience clocks are exponential while arrival processes are renewal. Our cost is an expected cumulative discounted function, linear or nonlinear, of appropriately normalized performance measures. As a special case, the cost per unit time can be a function of the number of customers waiting to be served in each class, the number actually being served, the abandonment rate, the delay experienced by customers, the number of idling servers, as well as certain combinations thereof. We study the system in an asymptotic heavy-traffic regime where the number of servers n and the offered load r are simultaneously scaled up and carefully balanced: n approximate to r + betarootr for some scalar beta. This yields an operation that enjoys the benefits of both heavy traffic (high server utilization) and light traffic (high service levels.) We first consider a formal weak limit, through which our queueing scheduling problem gives rise to a diffusion control problem. We show that the latter has an optimal Markov control policy, and that the corresponding Hamilton-Jacobi-Bellman (HJB) equation has a unique classical solution. The Markov control policy and the HJB equation are then used to define scheduling control policies which we prove are asymptotically optimal for our original queueing system. The analysis yields both qualitative and quantitative insights, in particular on staffing levels, the roles of non-preemption and work conservation, and the trade-off between service quality and servers' efficiency.
引用
收藏
页码:1084 / 1134
页数:51
相关论文
共 39 条
[1]  
[Anonymous], 2000, STOCHASTICS INT J PR
[2]  
ARMONY M, 2004, IN PRESS OPER RES
[3]   A Brownian control problem for a simple queueing system in the Halfin-Whitt regime [J].
Atar, R ;
Mandelbaum, A ;
Reiman, MI .
SYSTEMS & CONTROL LETTERS, 2004, 51 (3-4) :269-275
[4]  
Bell SL, 2001, ANN APPL PROBAB, V11, P608
[5]  
Billingsley P., 1999, CONVERGENCE PROBABIL
[6]  
Birkhoff G., 1962, ORDINARY DIFFERENTIA
[7]  
Borkar VS, 1989, PITMAN RES NOTES MAT, V203, P213
[8]  
BORST S, 2004, IN PRESS OPER RES
[9]  
BORST SC, 2000, ROBUST ALGORITHMS SH
[10]  
Durrett R, 1996, PROBABILITY THEORY E