共 3 条
[1]
Incremental multi-step Q-learning[J] . Jing Peng,Ronald J. Williams.Machine Learning . 1996 (1)
[2]
Technical Note: Q-Learning[J] . Christopher J.C.H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
[3]
Swinging up Control of Inverted Pendulum Using Pseudo-State Feedback. Furuta Yarnakita M, Kobayashi S. Systems and Control Letters . 1992