共 131 条
[61]
KONONEN V, P 4 INT C INT DAT EN, P68
[62]
Least-squares policy iteration
[J].
JOURNAL OF MACHINE LEARNING RESEARCH,
2004, 4 (06)
:1107-1149
[63]
Lauer Martin, P 17 INT C MACH LEAR, P535
[64]
LEE JW, P 13 INT C DAT EXP S, V2453, P153
[65]
LITTMAN ML, P 11 INT C MACH LEAR, P157
[66]
LITTMAN ML, P 8 INT WORKSH AG TH, P96
[67]
LITTMAN ML, 2001, J COGN SYST RES, V2, P55, DOI DOI 10.1016/S1389-0417(01)00015-8
[69]
Mataric M. J., 1996, Adaption and Learning in Multi-Agent Systems. IJCAI '95 Workshop. Proceedings, P152