Encoding of Both Positive and Negative Reward Prediction Errors by Neurons of the Primate Lateral Prefrontal Cortex and Caudate Nucleus

被引:78
作者
Asaad, Wael F.
Eskandar, Emad N.
机构
[1] Massachusetts Gen Hosp, Rhodan Ctr Nervous Syst Repair, Dept Neurosurg, Boston, MA 02114 USA
[2] Harvard Univ, Sch Med, Boston, MA 02114 USA
基金
美国国家科学基金会;
关键词
ANTERIOR CINGULATE CORTEX; STRIATAL NEURONS; DORSAL STRIATUM; DOPAMINERGIC-NEURONS; TEMPORAL PREDICTION; DELAYED-RESPONSE; DECISION-MAKING; SIGNALS; DISSOCIATION; VALUES;
D O I
10.1523/JNEUROSCI.3793-11.2011
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Learning can be motivated by unanticipated success or unexpected failure. The former encourages us to repeat an action or activity, whereas the latter leads us to find an alternative strategy. Understanding the neural representation of these unexpected events is therefore critical to elucidate learning-related circuits. We examined the activity of neurons in the lateral prefrontal cortex (PFC) and caudate nucleus of monkeys as they performed a trial-and-error learning task. Unexpected outcomes were widely represented in both structures, and neurons driven by unexpectedly negative outcomes were as frequent as those activated by unexpectedly positive outcomes. Moreover, both positive and negative reward prediction errors (RPEs) were represented primarily by increases in firing rate, unlike the manner in which dopamine neurons have been observed to reflect these values. Interestingly, positive RPEs tended to appear with shorter latency than negative RPEs, perhaps reflecting the mechanism of their generation. Last, in the PFC but not the caudate, trial-by-trial variations in outcome-related activity were linked to the animals' subsequent behavioral decisions. More broadly, the robustness of RPE signaling by these neurons suggests that actor-critic models of reinforcement learning in which the PFC and particularly the caudate are considered primarily to be "actors" rather than "critics," should be reconsidered to include a prominent evaluative role for these structures.
引用
收藏
页码:17772 / 17787
页数:16
相关论文
共 57 条
[41]   Memory fields of neurons in the primate prefrontal cortex [J].
Rainer, G ;
Asaad, WF ;
Miller, EK .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :15008-15013
[42]   Prediction error for free monetary reward in the human prefrontal cortex [J].
Ramnani, N ;
Elliott, R ;
Athwal, BS ;
Passinghm, RE .
NEUROIMAGE, 2004, 23 (03) :777-786
[43]  
Rescorla R.A., 1972, Class. Cond. II: Curr. Res. Theory, V2, P64, DOI DOI 10.1101/GR.110528.110
[44]   A cellular mechanism of reward-related learning [J].
Reynolds, JNJ ;
Hyland, BI ;
Wickens, JR .
NATURE, 2001, 413 (6851) :67-70
[45]   Ventral Striatal Neurons Encode the Value of the Chosen Action in Rats Deciding between Differently Delayed or Sized Rewards [J].
Roesch, Matthew R. ;
Singh, Teghpal ;
Brown, P. Leon ;
Mullins, Sylvina E. ;
Schoenbaum, Geoffrey .
JOURNAL OF NEUROSCIENCE, 2009, 29 (42) :13365-13376
[46]   Neuronal activity in the rodent dorsal striatum in sequential navigation: Separation of spatial and reward responses on the multiple T task [J].
Schmitzer-Torbert, N ;
Redish, AD .
JOURNAL OF NEUROPHYSIOLOGY, 2004, 91 (05) :2259-2272
[47]  
SCHULTZ W, 1993, PROG BRAIN RES, V99, P227
[48]   Reward prediction in primate basal ganglia and frontal cortex [J].
Schultz, W ;
Tremblay, L ;
Hollerman, JR .
NEUROPHARMACOLOGY, 1998, 37 (4-5) :421-429
[49]   Dynamic signals related to choices and outcomes in the dorslateral prefrontal cortex [J].
Seo, Hyojung ;
Barraclough, Dominic J. ;
Lee, Daeyeol .
CEREBRAL CORTEX, 2007, 17 :I110-I117
[50]   Differential encoding of losses and gains in the human striatum [J].
Seymour, Ben ;
Daw, Nathaniel ;
Dayan, Peter ;
Singer, Tania ;
Dolan, Ray .
JOURNAL OF NEUROSCIENCE, 2007, 27 (18) :4826-4831