Temporal difference models describe higher-order learning in humans

被引:453
作者
Seymour, B
O'Doherty, JP
Dayan, P
Koltzenburg, M
Jones, AK
Dolan, RJ
Friston, KJ
Frackowiak, RS
机构
[1] Wellcome Dept Imaging Neurosci, London WC1N 3BG, England
[2] Gatsby Computat Neurosci Unit, London WC1N 3AR, England
[3] UCL, Inst Child Hlth, London WC1N 1EH, England
[4] Univ Manchester, Ctr Rheumat Dis, Hope Hosp, Manchester M6 8HD, Lancs, England
[5] Fdn Santa Lucia, I-00179 Rome, Italy
基金
英国惠康基金; 匈牙利科学研究基金会;
关键词
D O I
10.1038/nature02581
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
The ability to use environmental stimuli to predict impending harm is critical for survival. Such predictions should be available as early as they are reliable. In pavlovian conditioning, chains of successively earlier predictors are studied in terms of higher-order relationships, and have inspired computational theories such as temporal difference learning(1). However, there is at present no adequate neurobiological account of how this learning occurs. Here, in a functional magnetic resonance imaging ( fMRI) study of higher-order aversive conditioning, we describe a key computational strategy that humans use to learn predictions about pain. We show that neural activity in the ventral striatum and the anterior insula displays a marked correspondence to the signals for sequential learning predicted by temporal difference models. This result reveals a flexible aversive learning process ideally suited to the changing and uncertain nature of real-world environments. Taken with existing data on reward learning(2), our results suggest a critical role for the ventral striatum in integrating complex appetitive and aversive predictions to coordinate behaviour.
引用
收藏
页码:664 / 667
页数:4
相关论文
共 30 条
[1]   AUTORADIOGRAPHIC ANALYSIS OF DIFFERENTIAL ASCENDING PROJECTIONS OF DORSAL AND MEDIAN RAPHE NUCLEI IN RAT [J].
AZMITIA, EC ;
SEGAL, M .
JOURNAL OF COMPARATIVE NEUROLOGY, 1978, 179 (03) :641-667
[2]  
Barto A. G., 1990, LEARNING COMPUTATION, P539
[3]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[4]  
Barto AG., 1995, Models of information processing in the basal ganglia, P215
[5]   Stimulus representation in SOP: I - Theoretical rationalization and some implications [J].
Brandon, SE ;
Vogel, EH ;
Wagner, AR .
BEHAVIOURAL PROCESSES, 2003, 62 (1-3) :5-25
[6]  
Büchel C, 1999, J NEUROSCI, V19, P10869
[7]   Classical fear conditioning in functional neuroimaging [J].
Büchel, C ;
Dolan, RJ .
CURRENT OPINION IN NEUROBIOLOGY, 2000, 10 (02) :219-223
[8]   THE ROLE OF THE BASAL GANGLIA IN NOCICEPTION AND PAIN [J].
CHUDLER, EH ;
DONG, WK .
PAIN, 1995, 60 (01) :3-38
[9]   Opponent interactions between serotonin and dopamine [J].
Daw, ND ;
Kakade, S ;
Dayan, P .
NEURAL NETWORKS, 2002, 15 (4-6) :603-616
[10]   Optimized EPI for fMRI studies of the orbitofrontal cortex [J].
Deichmann, R ;
Gottfried, JA ;
Hutton, C ;
Turner, R .
NEUROIMAGE, 2003, 19 (02) :430-441