共 54 条
[44]
SURI RE, UNPUB MODELING FUNCT
[45]
Sutton R., 1990, LEARNING COMPUTATION, P539
[46]
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[47]
Relative reward preference in primate orbitofrontal cortex
[J].
NATURE,
1999, 398 (6729)
:704-708