ASYMPTOTICALLY EFFICIENT ALLOCATION RULES FOR THE MULTIARMED BANDIT PROBLEM WITH MULTIPLE PLAYS .1. IID REWARDS

被引:182
作者
ANANTHARAM, V
VARAIYA, P
WALRAND, J
机构
[1] UNIV CALIF BERKELEY,DEPT ELECT ENGN & COMP SCI,BERKELEY,CA 94720
[2] UNIV CALIF BERKELEY,ELECTR RES LAB,BERKELEY,CA 94720
关键词
D O I
10.1109/TAC.1987.1104491
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
引用
收藏
页码:968 / 976
页数:9
相关论文
共 7 条
[1]  
Chow Y.S., 1971, GREAT EXPECTATIONS T
[2]  
HOGAN M, 1983, 21 STANF U DEP STAT
[3]   ASYMPTOTICALLY EFFICIENT ADAPTIVE ALLOCATION RULES [J].
LAI, TL ;
ROBBINS, H .
ADVANCES IN APPLIED MATHEMATICS, 1985, 6 (01) :4-22
[4]  
LAI TL, DESIGN EXPT, P127
[5]  
LAI TL, 1984, 23RD P IEEE C DEC CO, P51
[6]  
Neveu J., 1975, DISCRETE PARAMETER M
[7]   APPROXIMATIONS TO EXPECTED SAMPLE SIZE OF CERTAIN SEQUENTIAL TESTS [J].
POLLAK, M ;
SIEGMUND, D .
ANNALS OF STATISTICS, 1975, 3 (06) :1267-1282