共 40 条
[31]
Schwartz ES, 1997, J FINANC, V52, P923, DOI 10.2307/2329512
[37]
ASYNCHRONOUS STOCHASTIC-APPROXIMATION AND Q-LEARNING
[J].
MACHINE LEARNING,
1994, 16 (03)
:185-202
[38]
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698