How instructed knowledge modulates the neural systems of reward learning

被引：102

作者：

Li, Jian ^{[1
,2
]}

Delgado, Mauricio R. ^{[3
]}

Phelps, Elizabeth A. ^{[1
,2
]}

机构：

[1] NYU, Dept Psychol, New York, NY 10003 USA

[2] NYU, Ctr Neural Sci, New York, NY 10003 USA

[3] Rutgers State Univ, Dept Psychol, Newark, NJ 07102 USA

来源：

PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA | 2011年 / 108卷 / 01期

关键词：

functional MRI; striatum; instruction; computational modeling; prediction error; ORBITOFRONTAL CORTEX; PREFRONTAL CORTEX; DECISION-MAKING; RESPONSES; PREDICTION; VALUATION; STRIATUM; MODELS; REPRESENTATIONS; DISSOCIATION;

D O I：

10.1073/pnas.1014938108

中图分类号：

O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];

学科分类号：

07 ; 0710 ; 09 ;

摘要：

Recent research in neuroeconomics has demonstrated that the reinforcement learning model of reward learning captures the patterns of both behavioral performance and neural responses during a range of economic decision-making tasks. However, this powerful theoretical model has its limits. Trial-and-error is only one of the means by which individuals can learn the value associated with different decision options. Humans have also developed efficient, symbolic means of communication for learning without the necessity for committing multiple errors across trials. In the present study, we observed that instructed knowledge of cue-reward probabilities improves behavioral performance and diminishes reinforcement learning-related blood-oxygen level-dependent (BOLD) responses to feedback in the nucleus accumbens, ventromedial prefrontal cortex, and hippocampal complex. The decrease in BOLD responses in these brain regions to reward-feedback signals was functionally correlated with activation of the dorsolateral prefrontal cortex (DLPFC). These results suggest that when learning action values, participants use the DLPFC to dynamically adjust outcome responses in valuation regions depending on the usefulness of action-outcome information.

引用

页码：55 / 60

页数：6

共 55 条

[1] Beautiful faces have variable reward value: fMRI and behavioral evidence
Aharon, I
Etcoff, N
Ariely, D
Chabris, CF
O'Connor, E
Breiter, HC
[J]. NEURON, 2001, 32 (03) : 537 - 551
[2] Dissociated neural representations of intensity and valence in human olfaction
Anderson, AK
Christoff, K
Stappen, I
Panitz, D
Ghahremani, DG
Glover, G
Gabrieli, JDE
Sobel, N
[J]. NATURE NEUROSCIENCE, 2003, 6 (02) : 196 - 202
[3] Computational Models for the Combination of Advice and Individual Learning
Biele, Guido
Rieskamp, Joerg
Gonzalez, Richard
[J]. COGNITIVE SCIENCE, 2009, 33 (02) : 206 - 242
[4] Short-term memory traces for action bias in human reinforcement learning
Bogacz, Rafal
McClure, Samuel M.
Li, Jian
Cohen, Jonathan D.
Montague, P. Read
[J]. BRAIN RESEARCH, 2007, 1153 : 111 - 121
[5] Functional imaging of neural responses to expectancy and experience of monetary gains and losses
Breiter, HC
Aharon, I
Kahneman, D
Dale, A
Shizgal, P
[J]. NEURON, 2001, 30 (02) : 619 - 639
[6] Neural mechanisms of observational learning
Burke, Christopher J.
Tobler, Philippe N.
Baddeley, Michelle
Schultz, Wolfram
[J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (32) : 14431 - 14436
[7] Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
Daw, ND
Niv, Y
Dayan, P
[J]. NATURE NEUROSCIENCE, 2005, 8 (12) : 1704 - 1711
[8] Decision theory, reinforcement learning, and the brain
Dayan, Peter
Daw, Nathaniel D.
[J]. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE, 2008, 8 (04) : 429 - 453
[9] The neural basis of altruistic punishment
de Quervain, DJF
Fischbacher, U
Treyer, V
Schelthammer, M
Schnyder, U
Buck, A
Fehr, E
[J]. SCIENCE, 2004, 305 (5688) : 1254 - 1258
[10] Regulating the expectation of reward via cognitive strategies
Delgado, Mauricio R.
Gillis, M. Meredith
Phelps, Elizabeth A.
[J]. NATURE NEUROSCIENCE, 2008, 11 (08) : 880 - 881

← 1 2 3 4 5 6 →