共 7 条
[6]
Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning.[J].Richard S. Sutton;Doina Precup;Satinder Singh.Artificial Intelligence.1999, 1
[7]
分层强化学习理论与方法.[M].沈晶; 编著.哈尔滨工程大学出版社.2007,