共 57 条
[41]
Silver David., 2009, ICML, volume 382 of ACM International Conference Proceeding Series, V382, P119
[44]
Stern D., 2006, P 23 INT C MACH LEAR, P873, DOI DOI 10.1145/1143844.1143954
[45]
Stoutamire D., 1991, THESIS CASE W RESERV
[46]
Sturtevant NR, 2008, LECT NOTES COMPUT SC, V5131, P37, DOI 10.1007/978-3-540-87608-3_4
[47]
Sutton R. S., 1990, Machine Learning: Proceedings of the Seventh International Conference (1990), P216
[48]
Sutton RS, 2018, ADAPT COMPUT MACH LE, P1
[49]
Sutton Richard Stuart, 1984, Temporal credit assignment in reinforcement learning
[50]
Sutton RS, 1996, ADV NEUR IN, V8, P1038