How instructed knowledge modulates the neural systems of reward learning

被引:102
作者
Li, Jian [1 ,2 ]
Delgado, Mauricio R. [3 ]
Phelps, Elizabeth A. [1 ,2 ]
机构
[1] NYU, Dept Psychol, New York, NY 10003 USA
[2] NYU, Ctr Neural Sci, New York, NY 10003 USA
[3] Rutgers State Univ, Dept Psychol, Newark, NJ 07102 USA
关键词
functional MRI; striatum; instruction; computational modeling; prediction error; ORBITOFRONTAL CORTEX; PREFRONTAL CORTEX; DECISION-MAKING; RESPONSES; PREDICTION; VALUATION; STRIATUM; MODELS; REPRESENTATIONS; DISSOCIATION;
D O I
10.1073/pnas.1014938108
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Recent research in neuroeconomics has demonstrated that the reinforcement learning model of reward learning captures the patterns of both behavioral performance and neural responses during a range of economic decision-making tasks. However, this powerful theoretical model has its limits. Trial-and-error is only one of the means by which individuals can learn the value associated with different decision options. Humans have also developed efficient, symbolic means of communication for learning without the necessity for committing multiple errors across trials. In the present study, we observed that instructed knowledge of cue-reward probabilities improves behavioral performance and diminishes reinforcement learning-related blood-oxygen level-dependent (BOLD) responses to feedback in the nucleus accumbens, ventromedial prefrontal cortex, and hippocampal complex. The decrease in BOLD responses in these brain regions to reward-feedback signals was functionally correlated with activation of the dorsolateral prefrontal cortex (DLPFC). These results suggest that when learning action values, participants use the DLPFC to dynamically adjust outcome responses in valuation regions depending on the usefulness of action-outcome information.
引用
收藏
页码:55 / 60
页数:6
相关论文
共 55 条
  • [1] Beautiful faces have variable reward value: fMRI and behavioral evidence
    Aharon, I
    Etcoff, N
    Ariely, D
    Chabris, CF
    O'Connor, E
    Breiter, HC
    [J]. NEURON, 2001, 32 (03) : 537 - 551
  • [2] Dissociated neural representations of intensity and valence in human olfaction
    Anderson, AK
    Christoff, K
    Stappen, I
    Panitz, D
    Ghahremani, DG
    Glover, G
    Gabrieli, JDE
    Sobel, N
    [J]. NATURE NEUROSCIENCE, 2003, 6 (02) : 196 - 202
  • [3] Computational Models for the Combination of Advice and Individual Learning
    Biele, Guido
    Rieskamp, Joerg
    Gonzalez, Richard
    [J]. COGNITIVE SCIENCE, 2009, 33 (02) : 206 - 242
  • [4] Short-term memory traces for action bias in human reinforcement learning
    Bogacz, Rafal
    McClure, Samuel M.
    Li, Jian
    Cohen, Jonathan D.
    Montague, P. Read
    [J]. BRAIN RESEARCH, 2007, 1153 : 111 - 121
  • [5] Functional imaging of neural responses to expectancy and experience of monetary gains and losses
    Breiter, HC
    Aharon, I
    Kahneman, D
    Dale, A
    Shizgal, P
    [J]. NEURON, 2001, 30 (02) : 619 - 639
  • [6] Neural mechanisms of observational learning
    Burke, Christopher J.
    Tobler, Philippe N.
    Baddeley, Michelle
    Schultz, Wolfram
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2010, 107 (32) : 14431 - 14436
  • [7] Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control
    Daw, ND
    Niv, Y
    Dayan, P
    [J]. NATURE NEUROSCIENCE, 2005, 8 (12) : 1704 - 1711
  • [8] Decision theory, reinforcement learning, and the brain
    Dayan, Peter
    Daw, Nathaniel D.
    [J]. COGNITIVE AFFECTIVE & BEHAVIORAL NEUROSCIENCE, 2008, 8 (04) : 429 - 453
  • [9] The neural basis of altruistic punishment
    de Quervain, DJF
    Fischbacher, U
    Treyer, V
    Schelthammer, M
    Schnyder, U
    Buck, A
    Fehr, E
    [J]. SCIENCE, 2004, 305 (5688) : 1254 - 1258
  • [10] Regulating the expectation of reward via cognitive strategies
    Delgado, Mauricio R.
    Gillis, M. Meredith
    Phelps, Elizabeth A.
    [J]. NATURE NEUROSCIENCE, 2008, 11 (08) : 880 - 881