共 14 条
- [4] Planning and acting in partially observable stochastic domains[J] . Leslie Pack Kaelbling,Michael L. Littman,Anthony R. Cassandra.Artificial Intelligence . 1998 (1)
- [5] Elevator Group Control Using Multiple Reinforcement Learning Agents[J] . Robert H. Crites,Andrew G. Barto.Machine Learning . 1998 (2)
- [6] ASYNCHRONOUS STOCHASTIC-APPROXIMATION AND Q-LEARNING [J]. MACHINE LEARNING, 1994, 16 (03) : 185 - 202
- [7] Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching[J] . Long-Ji Lin.Machine Learning . 1992 (3)
- [8] Q -learning[J] . Christopher J. C. H. Watkins,Peter Dayan.Machine Learning . 1992 (3)
- [9] A situated-automata approach to the design of embedded agents[J] . Leslie Pack Kaelbling.ACM SIGART Bulletin . 1991 (4)