Model-Based Influences on Humans' Choices and Striatal Prediction Errors

被引:1086
作者
Daw, Nathaniel D. [1 ,2 ]
Gershman, Samuel J. [3 ,4 ]
Seymour, Ben [5 ]
Dayan, Peter [6 ]
Dolan, Raymond J. [5 ]
机构
[1] NYU, Ctr Neural Sci, New York, NY 10012 USA
[2] NYU, Dept Psychol, New York, NY 10012 USA
[3] Princeton Univ, Dept Psychol, Princeton, NJ 08540 USA
[4] Princeton Univ, Inst Neurosci, Princeton, NJ 08540 USA
[5] UCL, Inst Neurol, Wellcome Trust Ctr Neuroimaging, London WC1N 3BG, England
[6] UCL, Gatsby Computat Neurosci Unit, London WC1N 3AR, England
基金
英国惠康基金;
关键词
TEMPORAL DIFFERENCE MODELS; DOPAMINE NEURONS ENCODE; RISKY DECISION-MAKING; HUMAN BRAIN; BASAL GANGLIA; ORBITOFRONTAL CORTEX; PREFRONTAL CORTEX; NUCLEUS-ACCUMBENS; LEARNING SIGNALS; SUBJECTIVE VALUE;
D O I
10.1016/j.neuron.2011.02.027
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
The mesostriatal dopamine system is prominently implicated in model-free reinforcement learning, with fMRI BOLD signals in ventral striatum notably covarying with model-free prediction errors. However, latent learning and devaluation studies show that behavior also shows hallmarks of model-based planning, and the interaction between model-based and model-free values, prediction errors, and preferences is underexplored. We designed a multistep decision task in which model-based and model-free influences on human choice behavior could be distinguished. By showing that choices reflected both influences we could then test the purity of the ventral striatal BOLD signal as a model-free report. Contrary to expectations, the signal reflected both model-free and model-based predictions in proportions matching those that best explained choice behavior. These results challenge the notion of a separate model-free learner and suggest a more integrated computational architecture for high-level human decision-making.
引用
收藏
页码:1204 / 1215
页数:12
相关论文
共 91 条
[1]   VARIATIONS IN THE SENSITIVITY OF INSTRUMENTAL RESPONDING TO REINFORCER DEVALUATION [J].
ADAMS, CD .
QUARTERLY JOURNAL OF EXPERIMENTAL PSYCHOLOGY SECTION B-COMPARATIVE AND PHYSIOLOGICAL PSYCHOLOGY, 1982, 34 (MAY) :77-98
[2]  
[Anonymous], 2010, R LANG ENV STAT COMP
[3]  
[Anonymous], 1994, Cites
[4]  
[Anonymous], 1990, INT C MACHINE LEARNI
[5]   Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action [J].
Balleine, Bernard W. ;
O'Doherty, John P. .
NEUROPSYCHOPHARMACOLOGY, 2010, 35 (01) :48-69
[6]  
Balleine BW, 2009, NEUROECONOMICS: DECISION MAKING AND THE BRAIN, P367, DOI 10.1016/B978-0-12-374176-9.00024-5
[7]   Thought disorder and nucleus accumbens in childhood: a structural MRI study [J].
Ballmaier, M ;
Toga, AW ;
Siddarth, P ;
Blanton, RE ;
Levitt, JG ;
Lee, M ;
Caplan, R .
PSYCHIATRY RESEARCH-NEUROIMAGING, 2004, 130 (01) :43-55
[8]  
Barto A. G., 1995, Models of Information Processing in the Basal Ganglia, P215
[9]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[10]  
Bates D., 2010, IME4 LINEAR MIXED EF