共 20 条
[3]
Bauschke H. H., 1997, J CONVEX ANAL, V4, P27
[5]
BENTAL A, 1992, 992 TECHN OPT LAB
[6]
Bertsekas D., 2019, Reinforcement Learning and Optimal Control
[7]
BREITFELD MG, 1993, 1793 RUTG U
[10]
GULER O, 1991, SIAM J CONTROL OPTIM, V29, P403, DOI 10.1137/0329022