Temporal difference model reproduces anticipatory neural activity

被引:109
作者
Suri, RE [1 ]
Schultz, W [1 ]
机构
[1] Univ Fribourg, Inst Physiol, CH-1700 Fribourg, Switzerland
关键词
D O I
10.1162/089976601300014376
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Anticipatory neural activity preceding behaviorally important events has been reported in cortex, striatum, and midbrain dopamine neurons. Whereas dopamine neurons are phasically activated by reward-predictive stimuli, anticipatory activity of cortical and striatal neurons is increased during delay periods before important events. Characteristics of dopamine neuron activity resemble those of the prediction error signal of the temporal difference (TD) model of Pavlovian learning (Sutton & Barto, 1990). This study demonstrates that the prediction signal of the TD model reproduces characteristics of cortical and striatal anticipatory neural activity. This finding suggests that tonic anticipatory activities may reflect prediction signals that are involved in the processing of dopamine neuron activity.
引用
收藏
页码:841 / 862
页数:22
相关论文
共 54 条
[41]   Reward processing in primate orbitofrontal cortex and basal ganglia [J].
Schultz, W ;
Tremblay, L ;
Hollerman, JR .
CEREBRAL CORTEX, 2000, 10 (03) :272-283
[42]   A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task [J].
Suri, RE ;
Schultz, W .
NEUROSCIENCE, 1999, 91 (03) :871-890
[43]   Learning of sequential movements by neural network model with dopamine-like reinforcement signal [J].
Suri, RE ;
Schultz, W .
EXPERIMENTAL BRAIN RESEARCH, 1998, 121 (03) :350-354
[44]  
SURI RE, UNPUB MODELING FUNCT
[45]  
Sutton R., 1990, LEARNING COMPUTATION, P539
[46]  
Sutton R. S., 1998, Reinforcement Learning: An Introduction, V22447
[47]   Relative reward preference in primate orbitofrontal cortex [J].
Tremblay, L ;
Schultz, W .
NATURE, 1999, 398 (6729) :704-708
[48]   Reward-related neuronal activity during go-nogo task performance in primate orbitofrontal cortex [J].
Tremblay, L ;
Schultz, W .
JOURNAL OF NEUROPHYSIOLOGY, 2000, 83 (04) :1864-1876
[49]   Modifications of reward expectation-related neuronal activity during learning in primate striatum [J].
Tremblay, L ;
Hollerman, JR ;
Schultz, W .
JOURNAL OF NEUROPHYSIOLOGY, 1998, 80 (02) :964-977
[50]   Spatial processing in the monkey frontal eye field .1. Predictive visual responses [J].
Umeno, MM ;
Goldberg, ME .
JOURNAL OF NEUROPHYSIOLOGY, 1997, 78 (03) :1373-1383