共 11 条
[6]
Reinforcement distribution in fuzzy Q-learning[J] . Andrea Bonarini,Alessandro Lazaric,Francesco Montrone,Marcello Restelli.Fuzzy Sets and Systems . 2008 (10)
[7]
Restricted gradient-descent algorithm for value-function approximation in reinforcement learning[J] . André da Motta Salles Barreto,Charles W. Anderson.Artificial Intelligence . 2007 (4)
[8]
Kernel-Based Reinforcement Learning[J] . Machine Learning . 2002 (2)
[10]
Parametric value function approximation: Aunified view .2 Geist M,Pietquin O. Proceedings of the2011IEEE Symposium on Adaptive Dynamic Programming andReinforcement Learning . 2011