A predictive reinforcement model of dopamine neurons for learning approach behavior

被引:64
作者
Contreras-Vidal, JL [1 ]
Schultz, W
机构
[1] Arizona State Univ, Motor Control Lab, Tempe, AZ 85287 USA
[2] Univ Fribourg, Inst Physiol, CH-1700 Fribourg, Switzerland
关键词
neural network; prefrontal; reinforcement learning; striatum; timing;
D O I
10.1023/A:1008862904946
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A neural network model of how dopamine and prefrontal cortex activity guides short- and long-term information processing within the cortico-striatal circuits during reward-related learning of approach behavior is proposed. The model predicts two types of reward-related neuronal responses generated during learning: (1) cell activity signaling errors in the prediction of the expected time of reward delivery and (2) neural activations coding for errors in the prediction of the amount and type of reward or stimulus expectancies. The former type of signal is consistent with the responses of dopaminergic neurons, while the latter signal is consistent with reward expectancy responses reported in the prefrontal cortex. It is shown that a neural network architecture that satisfies the design principles of the adaptive resonance theory of Carpenter and Grossberg (1987) can account for the dopamine responses to novelty, generalization, and discrimination of appetitive and aversive stimuli. These hypotheses are scrutinized via simulations of the model in relation to the delivery of free food outside a task, the timed contingent delivery of appetitive and aversive stimuli, and an asymmetric, instructed delay response task.
引用
收藏
页码:191 / 214
页数:24
相关论文
共 87 条
[11]   ART-3 - HIERARCHICAL SEARCH USING CHEMICAL TRANSMITTERS IN SELF-ORGANIZING PATTERN-RECOGNITION ARCHITECTURES [J].
CARPENTER, GA ;
GROSSBERG, S .
NEURAL NETWORKS, 1990, 3 (02) :129-152
[12]   ART-2 - SELF-ORGANIZATION OF STABLE CATEGORY RECOGNITION CODES FOR ANALOG INPUT PATTERNS [J].
CARPENTER, GA ;
GROSSBERG, S .
APPLIED OPTICS, 1987, 26 (23) :4919-4930
[13]  
Contreras-Vidal J. L., 1996, Society for Neuroscience Abstracts, V22, P2029
[14]   A NEURAL MODEL OF BASAL GANGLIA-THALAMOCORTICAL RELATIONS IN NORMAL AND PARKINSONIAN MOVEMENT [J].
CONTRERASVIDAL, JL ;
STELMACH, GE .
BIOLOGICAL CYBERNETICS, 1995, 73 (05) :467-476
[15]  
EBLEN F, 1995, J NEUROSCI, V15, P5999
[16]  
Fiala JC, 1996, J NEUROSCI, V16, P3760
[17]   INTERACTION OF THE AMYGDALA WITH THE FRONTAL-LOBE IN REWARD MEMORY [J].
GAFFAN, D ;
MURRAY, EA ;
FABRETHORPE, M .
EUROPEAN JOURNAL OF NEUROSCIENCE, 1993, 5 (07) :968-975
[18]   BURST FIRING INDUCED IN MIDBRAIN DOPAMINE NEURONS BY STIMULATION OF THE MEDIAL PREFRONTAL AND ANTERIOR CINGULATE CORTICES [J].
GARIANO, RF ;
GROVES, PM .
BRAIN RESEARCH, 1988, 462 (01) :194-198
[19]   TOPOGRAPHY AND COLLATERALIZATION OF THE DOPAMINERGIC PROJECTIONS TO MOTOR AND LATERAL PREFRONTAL CORTEX IN OWL MONKEYS [J].
GASPAR, P ;
STEPNIEWSKA, I ;
KAAS, JH .
JOURNAL OF COMPARATIVE NEUROLOGY, 1992, 325 (01) :1-21
[20]  
GERFEN CR, 1987, J NEUROSCI, V7, P3935