共 7 条
[1]
Brooks R.A.(1986)A robust layered control system for a mobile robot IEEE Journal of Robotics and Automation RA-2 14-23
[2]
Moore A.W.(1992)Fast, robust adaptive control by learning only forward models Advances in Neural Information Processing 4 571-579
[3]
Schaal S.(1994)Robot juggling: An implementation of memory-bassed learning Control Systems Magazine 14 57-71
[4]
Atkeson C.C.(1988)Learning to predict by method of temporal differences Machine Learning 3 9-44
[5]
Sutton R.(1992)Q-learning Machine Learning 8 279-292
[6]
Watkins C.J.C.H.(undefined)undefined undefined undefined undefined-undefined
[7]
Dayan P.(undefined)undefined undefined undefined undefined-undefined