A predictive reinforcement model of dopamine neurons for learning approach behavior

被引:64
作者
Contreras-Vidal, JL [1 ]
Schultz, W
机构
[1] Arizona State Univ, Motor Control Lab, Tempe, AZ 85287 USA
[2] Univ Fribourg, Inst Physiol, CH-1700 Fribourg, Switzerland
关键词
neural network; prefrontal; reinforcement learning; striatum; timing;
D O I
10.1023/A:1008862904946
中图分类号
Q [生物科学];
学科分类号
07 ; 0710 ; 09 ;
摘要
A neural network model of how dopamine and prefrontal cortex activity guides short- and long-term information processing within the cortico-striatal circuits during reward-related learning of approach behavior is proposed. The model predicts two types of reward-related neuronal responses generated during learning: (1) cell activity signaling errors in the prediction of the expected time of reward delivery and (2) neural activations coding for errors in the prediction of the amount and type of reward or stimulus expectancies. The former type of signal is consistent with the responses of dopaminergic neurons, while the latter signal is consistent with reward expectancy responses reported in the prefrontal cortex. It is shown that a neural network architecture that satisfies the design principles of the adaptive resonance theory of Carpenter and Grossberg (1987) can account for the dopamine responses to novelty, generalization, and discrimination of appetitive and aversive stimuli. These hypotheses are scrutinized via simulations of the model in relation to the delivery of free food outside a task, the timed contingent delivery of appetitive and aversive stimuli, and an asymmetric, instructed delay response task.
引用
收藏
页码:191 / 214
页数:24
相关论文
共 87 条
[1]   PARALLEL ORGANIZATION OF FUNCTIONALLY SEGREGATED CIRCUITS LINKING BASAL GANGLIA AND CORTEX [J].
ALEXANDER, GE ;
DELONG, MR ;
STRICK, PL .
ANNUAL REVIEW OF NEUROSCIENCE, 1986, 9 :357-381
[2]   NEURONAL-ACTIVITY IN MONKEY STRIATUM RELATED TO THE EXPECTATION OF PREDICTABLE ENVIRONMENTAL EVENTS [J].
APICELLA, P ;
SCARNATI, E ;
LJUNGBERG, T ;
SCHULTZ, W .
JOURNAL OF NEUROPHYSIOLOGY, 1992, 68 (03) :945-960
[3]  
APICELLA P, 1991, EXP BRAIN RES, V85, P491
[4]   Neural signals in the monkey ventral striatum related to motivation for juice and cocaine rewards [J].
Bowman, EM ;
Aigner, TG ;
Richmond, BJ .
JOURNAL OF NEUROPHYSIOLOGY, 1996, 75 (03) :1061-1073
[5]   COGNITIVE FUNCTION IN PARKINSONS-DISEASE - FROM DESCRIPTION TO THEORY [J].
BROWN, RG ;
MARSDEN, CD .
TRENDS IN NEUROSCIENCES, 1990, 13 (01) :21-29
[6]   DOPAMINE DEPENDENT REACTION-TIME DEFICITS IN PATIENTS WITH PARKINSONS-DISEASE ARE TASK SPECIFIC [J].
BROWN, VJ ;
SCHWARZ, U ;
BOWMAN, EM ;
FUHR, P ;
ROBINSON, DL ;
HALLETT, M .
NEUROPSYCHOLOGIA, 1993, 31 (05) :459-469
[7]   NEURAL-NETWORK MODEL OF THE CEREBELLUM - TEMPORAL DISCRIMINATION AND THE TIMING OF MOTOR-RESPONSES [J].
BUONOMANO, DV ;
MAUK, MD .
NEURAL COMPUTATION, 1994, 6 (01) :38-55
[8]  
CALABRESI P, 1992, J NEUROSCI, V12, P4224
[9]   THE PERFORMANCE ON LEARNING-TASKS OF PATIENTS IN THE EARLY STAGES OF PARKINSONS-DISEASE [J].
CANAVAN, AGM ;
PASSINGHAM, RE ;
MARSDEN, CD ;
QUINN, N ;
WYKE, M ;
POLKEY, CE .
NEUROPSYCHOLOGIA, 1989, 27 (02) :141-156
[10]   Distributed learning, recognition, and prediction by ART and ARTMAP neural networks [J].
Carpenter, GA .
NEURAL NETWORKS, 1997, 10 (08) :1473-1494