共 23 条
[11]
Recent Advances in Hierarchical Reinforcement Learning[J] . Andrew G. Barto,Sridhar Mahadevan.Discrete Event Dynamic Systems . 2003 (4)
[12]
Kernel-Based Reinforcement Learning[J] . Machine Learning . 2002 (2)
[13]
Technical Update: Least-Squares Temporal Difference Learning[J] . Justin A. Boyan.Machine Learning . 2002 (2)
[16]
Linear Least-Squares algorithms for temporal difference learning[J] . Steven J. Bradtke,Andrew G. Barto.Machine Learning . 1996 (1)
[19]
TD-Gammon, a Self-Teaching Backgammon Program, Achieves Master-Level Play[J] . Gerald Tesauro.Neural Computation . 1994 (2)
[20]
ASYNCHRONOUS STOCHASTIC-APPROXIMATION AND Q-LEARNING
[J].
MACHINE LEARNING,
1994, 16 (03)
:185-202