共 13 条
[2]
[Anonymous], 2006, RR6062 INRIA
[3]
AUDIBERT JY, 2004, THESIS U PARIS 6 PAR
[5]
AUER P, 2006, 2 PASCAL CHALL WORKS
[7]
Gittins JC., 1989, Wiley-Interscience Series in Systems and Optimization
[9]
Bandit based Monte-Carlo planning
[J].
MACHINE LEARNING: ECML 2006, PROCEEDINGS,
2006, 4212
:282-293