共 12 条
[1]
ABOUNADI J, 1997, UNPUB Q LEARNING ALG
[2]
[Anonymous], 1963, J MATH ANAL APPL
[3]
Bertsekas D. P., 2005, Dynamic programming and optimal control, V1
[4]
Bertsekas Dimitri P., 1989, PARALLEL DISTRIBUTED
[5]
Bertsekas DP, 1995, Dynamic Programming and Optimal Control, V2
[8]
POPYACK JL, 1969, IEEE T AUTOMAT CONTR, V24, P503
[9]
Puterman M L., 1994, MARKOVIAN DECISION P