BRANCHING BANDITS AND KLIMOVS PROBLEM - ACHIEVABLE REGION AND SIDE CONSTRAINTS

被引:13
作者
BERTSIMAS, D [1 ]
PASCHALIDIS, IC [1 ]
TSITSIKLIS, JN [1 ]
机构
[1] MIT,CTR OPERAT RES,CAMBRIDGE,MA 02139
关键词
D O I
10.1109/9.478231
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
We consider the average cost branching bandits problem and its special case known as Klimov's problem, We consider the vector n whose components are the mean number of bandits (or customers) of each type that are present. We characterize fully the achievable region, that is, the set of all possible vectors n that can be obtained by considering all possible policies. While the original description of the achievable region involves exponentially many constraints, we also develop an alternative description that involves only O(R(2)) variables and constraints, where a is the number of bandit types (or customer classes). We then consider the problem of minimizing a linear function of n subject to L additional linear constraints on n. We show that optimal policies can be obtained by randomizing between L + 1 strict priority policies that can be found efficiently (in polynomial time) using linear programming techniques.
引用
收藏
页码:2063 / 2075
页数:13
相关论文
共 18 条
[1]  
BERTSIMAS D, 1992, CONSERVATION LAWS EX
[2]  
BERTSIMAS D, 1992, OCT P WORKSH HIER CO
[3]   OPTIMIZATION OF MULTICLASS QUEUEING NETWORKS: POLYHEDRAL AND NONLINEAR CHARACTERIZATIONS OF ACHIEVABLE PERFORMANCE [J].
Bertsimas, Dimitris ;
Paschalidis, Ioannis Ch. ;
Tsitsiklis, John N. .
ANNALS OF APPLIED PROBABILITY, 1994, 4 (01) :43-75
[4]  
BHATTACHARYA PP, 1992, EXTENDED POLYMATROID
[5]   CHARACTERIZATION AND OPTIMIZATION OF ACHIEVABLE PERFORMANCE IN GENERAL QUEUING-SYSTEMS [J].
FEDERGRUEN, A ;
GROENEVELT, H .
OPERATIONS RESEARCH, 1988, 36 (05) :733-741
[6]  
GELENBE E, 1980, ANAL SYNTHESIS COMPU
[7]  
GITTINS JC, 1989, MULTIARMED BANDIT AL
[9]  
KLIMOV GP, 1974, THEOR PROB APPLICAT, V19
[10]   PERFORMANCE BOUNDS FOR QUEUING-NETWORKS AND SCHEDULING POLICIES [J].
KUMAR, S ;
KUMAR, PR .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1994, 39 (08) :1600-1611