Encoding of Both Positive and Negative Reward Prediction Errors by Neurons of the Primate Lateral Prefrontal Cortex and Caudate Nucleus

被引:78
作者
Asaad, Wael F.
Eskandar, Emad N.
机构
[1] Massachusetts Gen Hosp, Rhodan Ctr Nervous Syst Repair, Dept Neurosurg, Boston, MA 02114 USA
[2] Harvard Univ, Sch Med, Boston, MA 02114 USA
基金
美国国家科学基金会;
关键词
ANTERIOR CINGULATE CORTEX; STRIATAL NEURONS; DORSAL STRIATUM; DOPAMINERGIC-NEURONS; TEMPORAL PREDICTION; DELAYED-RESPONSE; DECISION-MAKING; SIGNALS; DISSOCIATION; VALUES;
D O I
10.1523/JNEUROSCI.3793-11.2011
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Learning can be motivated by unanticipated success or unexpected failure. The former encourages us to repeat an action or activity, whereas the latter leads us to find an alternative strategy. Understanding the neural representation of these unexpected events is therefore critical to elucidate learning-related circuits. We examined the activity of neurons in the lateral prefrontal cortex (PFC) and caudate nucleus of monkeys as they performed a trial-and-error learning task. Unexpected outcomes were widely represented in both structures, and neurons driven by unexpectedly negative outcomes were as frequent as those activated by unexpectedly positive outcomes. Moreover, both positive and negative reward prediction errors (RPEs) were represented primarily by increases in firing rate, unlike the manner in which dopamine neurons have been observed to reflect these values. Interestingly, positive RPEs tended to appear with shorter latency than negative RPEs, perhaps reflecting the mechanism of their generation. Last, in the PFC but not the caudate, trial-by-trial variations in outcome-related activity were linked to the animals' subsequent behavioral decisions. More broadly, the robustness of RPE signaling by these neurons suggests that actor-critic models of reinforcement learning in which the PFC and particularly the caudate are considered primarily to be "actors" rather than "critics," should be reconsidered to include a prominent evaluative role for these structures.
引用
收藏
页码:17772 / 17787
页数:16
相关论文
共 57 条
[1]   Prediction error as a linear function of reward probability is coded in human nucleus accumbens [J].
Abler, Birgit ;
Walter, Henrik ;
Erk, Susanne ;
Kammerer, Hannes ;
Spitzer, Manfred .
NEUROIMAGE, 2006, 31 (02) :790-795
[2]   TEMPORAL AND SPATIAL CHARACTERISTICS OF TONICALLY ACTIVE NEURONS OF THE PRIMATES STRIATUM [J].
AOSAKI, T ;
KIMURA, M ;
GRAYBIEL, AM .
JOURNAL OF NEUROPHYSIOLOGY, 1995, 73 (03) :1234-1252
[3]   A flexible software tool for temporally-precise behavioral control in Matlab [J].
Asaad, Wael F. ;
Eskandar, Emad N. .
JOURNAL OF NEUROSCIENCE METHODS, 2008, 174 (02) :245-258
[4]   Neural activity in the primate prefrontal cortex during associative learning [J].
Asaad, WF ;
Rainer, G ;
Miller, EK .
NEURON, 1998, 21 (06) :1399-1407
[5]   Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories [J].
Barnes, TD ;
Kubota, Y ;
Hu, D ;
Jin, DZZ ;
Graybiel, AM .
NATURE, 2005, 437 (7062) :1158-1161
[6]   Prefrontal cortex and decision making in a mixed-strategy game [J].
Barraclough, DJ ;
Conroy, ML ;
Lee, D .
NATURE NEUROSCIENCE, 2004, 7 (04) :404-410
[7]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[8]   CONTROLLING THE FALSE DISCOVERY RATE - A PRACTICAL AND POWERFUL APPROACH TO MULTIPLE TESTING [J].
BENJAMINI, Y ;
HOCHBERG, Y .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1995, 57 (01) :289-300
[9]   Comparison of learning-related neuronal activity in the dorsal premotor cortex and striatum [J].
Brasted, PJ ;
Wise, SP .
EUROPEAN JOURNAL OF NEUROSCIENCE, 2004, 19 (03) :721-740
[10]   Shifts in striatal responsivity evoked by chronic stimulation of dopamine and glutamate systems [J].
Canales, JJ ;
Capper-Loup, C ;
Hu, D ;
Choe, ES ;
Upadhyay, U ;
Graybiel, AM .
BRAIN, 2002, 125 :2353-2363