共 33 条
[1]
Acosta-Abreu R. S., 1985, Control and Cybernetics, V14, P313
[3]
[Anonymous], 2010, Dynamic programming
[4]
Berry D. A., 1985, BANDIT PROBLEMS SEQU
[5]
Beutler F. J., 1989, Stochastics, V26, P81, DOI 10.1080/17442508908833551
[6]
Burnetas A. N., 1993, PROBAB ENG INFORM SC, V7, P85, DOI DOI 10.1017/S0269964800002801
[7]
BURNETAS AN, 1994, OPTIMAL ADAPTIVE POL
[8]
BURNETAS AN, 1989, OPTIMAL SEQUENTIAL A
[9]
Dembo A., 1993, Large deviations techniques and applications
[10]
Dynkin E.B., 1979, Grundlehren der Mathematischen Wissenschaften, V235