共 4 条
[2]
Recent Advances in Hierarchical Reinforcement Learning[J] . Andrew G. Barto,Sridhar Mahadevan.Discrete Event Dynamic Systems . 2003 (1)
[3]
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning[J] . Richard S. Sutton,Doina Precup,Satinder Singh.Artificial Intelligence . 1999 (1)
[4]
Self-Improving Reactive Agents Based on Reinforcement Learning, Planning and Teaching[J] . Long-Ji Lin.Machine Learning . 1992 (3)