共 5 条
[1]
Embedding a Priori Knowledge in Reinforcement Learning.[J].Carlos H. C. Ribeiro.Journal of Intelligent and Robotic Systems.1998, 1
[2]
Colearning in Differential Games.[J].John W. Sheppard.Machine Learning.1998, 2
[3]
Technical Note: Q-Learning.[J].Christopher J.C.H. Watkins;Peter Dayan.Machine Learning.1992, 3
[4]
Learning to predict by the methods of temporal differences.[J].Richard S. Sutton.Machine Learning.1988, 1
[5]

