共 35 条
[1]
ABOUNADI J, 1996, ODE ANAL Q LEARNING
[2]
[Anonymous], PROC ICML
[4]
Bertsekas D. P., 1996, Neuro Dynamic Programming, V1st
[5]
Bertsekas DP, 2012, DYNAMIC PROGRAMMING, V2
[8]
An analog scheme for fixed point computation .1. Theory
[J].
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS I-FUNDAMENTAL THEORY AND APPLICATIONS,
1997, 44 (04)
:351-355
[9]
BORKAR VS, ODE METHOD CONVERGEN
[10]
BUTTON R, 1998, REINFORCEMENT LEARNI