共 16 条
[11]
LU Y, 2002, MACH LEARN, V46, P361
[14]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[15]
TOUSSAINT M, 2002, P INT JOINT C NEUR N
[16]
WATKINS CJCH, 1992, MACH LEARN, V8, P279, DOI 10.1007/BF00992698