共 5 条
[2]
Elevator Group Control Using Multiple Reinforcement Learning Agents[J] . Robert H. Crites,Andrew G. Barto.Machine Learning . 1998 (2)
[3]
The loss from imperfect value functions in expectation-based and minimax-based tasks[J] . Matthias Heger.Machine Learning . 1996 (1)
[4]
Technical Note: Q-Learning[J] . Christopher J.C.H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
[5]
Approximation by superpositions of a sigmoidal function[J] . G. Cybenko.Mathematics of Control, Signals and Systems . 1989 (4)