共 19 条
[1]
Bertsekas D., 2019, REINFORCEMENT LEARNI
[2]
BORWEIN J, 1992, CORR9232 U WAT
[3]
CENSOR Y., 1990, NUMERICAL ANAL MATH, V24, P145
[4]
COMINETTI R, IN PRESS J OPTIMIZAT
[5]
DENHERTOG D, 1991, 9127 DELFT U TECHN F
[8]
FRISCH KR, 1955, COMMUNICATION 0513
[10]
GULER O, 1991, LIMITING BEHAVIOR WE