共 2 条
[1]
Simple Statistical Gradient-Following Algorithms for Connectionist Reinforcement Learning.[J].Ronald J. Williams.Machine Learning.1992, 3
[2]
Technical Note: Q-Learning.[J].Christopher J.C.H. Watkins;Peter Dayan.Machine Learning.1992, 3

