Neural signature of fictive learning signals in a sequential investment task

被引:185
作者
Lohrenz, Terry
McCabe, Kevin
Camerer, Colin F.
Montague, P. Read
机构
[1] Baylor Coll Med, Dept Neurosci, Houston, TX 77030 USA
[2] Baylor Coll Med, Menninger Dept Psychait & Behav Sci, Houston, TX 77030 USA
[3] George Mason Univ, Dept Econ, Fairfax, VA 22030 USA
[4] CALTECH, Pasadena, CA 91125 USA
关键词
counterfactual signals; decision-making; neuroeconomics; reinforcement learning;
D O I
10.1073/pnas.0608842104
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Reinforcement learning models now provide principled guides for a wide range of reward learning experiments in animals and humans. One key learning (error) signal in these models is experiential and reports ongoing temporal differences between expected and experienced reward. However, these same abstract learning models also accommodate the existence of another class of learning signal that takes the form of a fictive error encoding ongoing differences between experienced returns and returns that "could-have-been-experienced" if decisions had been different. These observations suggest the hypothesis that, for all real-world learning tasks, one should expect the presence of both experiential and fictive learning signals. Motivated by this possibility, we used a sequential investment game and fMRI to probe ongoing brain responses to both experiential and fictive learning signals generated throughout the game. Using a large cohort of subjects (n = 54), we report that fictive learning signals strongly predict changes in subjects' investment behavior and correlate with fMRI signals measured in dopaminoceptive structures known to be involved in valuation and choice.
引用
收藏
页码:9493 / 9498
页数:6
相关论文
共 48 条
[1]  
Ashburner J, 1999, HUM BRAIN MAPP, V7, P254, DOI 10.1002/(SICI)1097-0193(1999)7:4<254::AID-HBM4>3.0.CO
[2]  
2-G
[3]   Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[4]   Deciding advantageously before knowing the advantageous strategy [J].
Bechara, A ;
Damasio, H ;
Tranel, D ;
Damasio, AR .
SCIENCE, 1997, 275 (5304) :1293-1295
[5]   REGRET IN DECISION-MAKING UNDER UNCERTAINTY [J].
BELL, DE .
OPERATIONS RESEARCH, 1982, 30 (05) :961-981
[6]   Functional imaging of neural responses to expectancy and experience of monetary gains and losses [J].
Breiter, HC ;
Aharon, I ;
Kahneman, D ;
Dale, A ;
Shizgal, P .
NEURON, 2001, 30 (02) :619-639
[7]   Experience-weighted attraction learning in normal form games [J].
Camerer, C ;
Ho, TH .
ECONOMETRICA, 1999, 67 (04) :827-874
[8]  
Camerer C. F, 2002, BEHAV GAME THEORY EX
[9]   The involvement of the orbitofrontal cortex in the experience of regret [J].
Camille, N ;
Coricelli, G ;
Sallet, J ;
Pradat-Diehl, P ;
Duhamel, JR ;
Sirigu, A .
SCIENCE, 2004, 304 (5674) :1167-1170
[10]   Regret and its avoidance: a neuroimaging study of choice behavior [J].
Coricelli, G ;
Critchley, HD ;
Joffily, M ;
O'Doherty, JP ;
Sirigu, A ;
Dolan, RJ .
NATURE NEUROSCIENCE, 2005, 8 (09) :1255-1262