A LEMMA ON THE MULTIARMED BANDIT PROBLEM

被引:15
作者
TSITSIKLIS, JN
机构
[1] MIT, Cambridge, MA, USA, MIT, Cambridge, MA, USA
关键词
D O I
10.1109/TAC.1986.1104332
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
4
引用
收藏
页码:576 / 577
页数:2
相关论文
共 4 条
[1]  
GITTINS JC, 1979, J ROY STAT SOC B MET, V41, P148
[2]  
Shreve S., 1976, DYNAMIC PROGRAMMING, P105
[3]   EXTENSIONS OF THE MULTIARMED BANDIT PROBLEM - THE DISCOUNTED CASE [J].
VARAIYA, PP ;
WALRAND, JC ;
BUYUKKOC, C .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1985, 30 (05) :426-439
[4]  
WHITTLE P, 1982, OPTIMIZATION OVER TI