共 34 条
[3]
[Anonymous], IEEE WORLD C COMP IN
[4]
[Anonymous], 2004, IEEE T AUTOMAT CONTR, DOI DOI 10.1109/TAC.1972.1100008
[5]
[Anonymous], IEEE P CDC 89
[6]
[Anonymous], 1989, LEARNING DELAYED REW
[8]
Bertsekas Dimitri, 1996, Neuro dynamic programming
[10]
Reinforcement learning in continuous time and space
[J].
NEURAL COMPUTATION,
2000, 12 (01)
:219-245