共 7 条
[5]
Technical Note: Q-Learning[J] . Christopher J.C.H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
[6]
Adaptation technique for integrating genetic pro-gramming and reinforcement learning for real robots. KAMIO,IBA H. IEEE Trans-actions on Evolutionary Computation . 2005
[7]
Reinforcement learning for long-run average cost. Gosavi Abhijit. European Journal of Operational Research . 2004