共 49 条
[41]
Sondik E. J., 1978, OPERATIONS RES, V26
[42]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[43]
SUTTON RS, 2000, NEURAL INFORMATION P
[44]
TAO N, 2001, MULTIAGENT POLICY GR
[46]
TESAURO G, 1992, MACH LEARN, V8, P257, DOI 10.1007/BF00992697
[48]
WILLIAMS RJ, 1992, MACH LEARN, V8, P229, DOI 10.1007/BF00992696
[49]
ZHANG W, 1995, P 14 INT JOINT C ART, P1114