共 66 条
[1]
Reinforcement learning based algorithms for average cost Markov Decision Processes
[J].
DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS,
2007, 17 (01)
:23-52
[3]
ALEKSANDROV VM, 1968, ENG CYBERN, P11
[4]
[5]
[Anonymous], 2009, Advances in Neural Information Processing Systems
[6]
[Anonymous], 1999, Nonlinear Programming
[7]
[Anonymous], 2007, Control Techniques for Complex Networks
[8]
[Anonymous], 2008, Proc. Advances in Neural Information Processing Systems (NIPS)
[9]
Bagnell J. A., 2003, INT JOINT C ART INT
[10]
Baird L. C., 1993, Tech. Rep. WL-TR-93-1146

