共 5 条
[3]
Explanation-Based Learning and Reinforcement Learning: A Unified View[J] . Thomas G. Dietterich,Nicholas S. Flann.Machine Learning . 1997 (2)
[4]
Technical Note: Q-Learning[J] . Christopher J.C.H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
[5]
Learning to Predict by the Methods of Temporal Differences[J] . Richard S. Sutton.Machine Learning . 1988 (1)