Learning the value of information in an uncertain world

被引:1370
作者
Behrens, Timothy E. J. [1 ]
Woolrich, Mark W.
Walton, Mark E.
Rushworth, Matthew F. S.
机构
[1] Univ Oxford, John Radcliffe Hosp, FMRIB Ctr, Oxford OX3 9DU, England
[2] Univ Oxford, Dept Expt Psychol, Oxford OX1 3UD, England
基金
英国医学研究理事会; 英国惠康基金;
关键词
D O I
10.1038/nn1954
中图分类号
Q189 [神经科学];
学科分类号
071006 [神经生物学];
摘要
Our decisions are guided by outcomes that are associated with decisions made in the past. However, the amount of influence each past outcome has on our next decision remains unclear. To ensure optimal decision-making, the weight given to decision outcomes should reflect their salience in predicting future outcomes, and this salience should be modulated by the volatility of the reward environment. We show that human subjects assess volatility in an optimal manner and adjust decision-making accordingly. This optimal estimate of volatility is reflected in the fMRI signal in the anterior cingulate cortex ( ACC) when each trial outcome is observed. When a new piece of information is witnessed, activity levels reflect its salience for predicting future outcomes. Furthermore, variations in this ACC signal across the population predict variations in subject learning rates. Our results provide a formal account of how we weigh our different experiences in guiding our future actions.
引用
收藏
页码:1214 / 1221
页数:8
相关论文
共 51 条
[1]
Reward encoding in the monkey anterior cingulate cortex [J].
Amiez, C. ;
Joseph, J. P. ;
Procyk, E. .
CEREBRAL CORTEX, 2006, 16 (07) :1040-1055
[2]
An integrative theory of locus coeruleus-norepinephrine function: Adaptive gain and optimal performance [J].
Aston-Jones, G ;
Cohen, JD .
ANNUAL REVIEW OF NEUROSCIENCE, 2005, 28 :403-450
[3]
Midbrain dopamine neurons encode a quantitative reward prediction error signal [J].
Bayer, HM ;
Glimcher, PW .
NEURON, 2005, 47 (01) :129-141
[4]
Conflict monitoring and anterior cingulate cortex: an update [J].
Botvinick, Matthew M. ;
Cohen, Jonathan D. ;
Carter, Cameron S. .
TRENDS IN COGNITIVE SCIENCES, 2004, 8 (12) :539-546
[5]
Learned predictions of error likelihood in the anterior cingulate cortex [J].
Brown, JW ;
Braver, TS .
SCIENCE, 2005, 307 (5712) :1118-1121
[6]
Bayesian theories of conditioning in a changing world [J].
Courville, Aaron C. ;
Daw, Nathaniel D. ;
Touretzky, David S. .
TRENDS IN COGNITIVE SCIENCES, 2006, 10 (07) :294-300
[7]
PROBABILITY, FREQUENCY AND REASONABLE EXPECTATION [J].
COX, RT .
AMERICAN JOURNAL OF PHYSICS, 1946, 14 (01) :1-13
[8]
Neural activity in the human brain relating to uncertainty and arousal during anticipation [J].
Critchley, HD ;
Mathias, CJ ;
Dolan, RJ .
NEURON, 2001, 29 (02) :537-545
[9]
Cortical substrates for exploratory decisions in humans [J].
Daw, Nathaniel D. ;
O'Doherty, John P. ;
Dayan, Peter ;
Seymour, Ben ;
Dolan, Raymond J. .
NATURE, 2006, 441 (7095) :876-879
[10]
Learning and selective attention [J].
Dayan, Peter ;
Kakade, Sham ;
Montague, P. Read .
NATURE NEUROSCIENCE, 2000, 3 (11) :1218-1223