Dopamine neurons can represent context-dependent prediction error

被引:189
作者
Nakahara, H
Itoh, H
Kawagoe, R
Takikawa, Y
Hikosaka, O
机构
[1] RIKEN, Brain Sci Inst, Lab Math Neurosci, Wako, Saitama 3510198, Japan
[2] Tokyo Inst Technol, Interdisciplinary Grad Sch Sci & Engn, Dept Computat Intelligence & Syst Sci, Yokohama, Kanagawa 2268502, Japan
[3] Juntendo Univ, Sch Med, Dept Physiol, Tokyo 1138421, Japan
[4] NEI, Sensorimotor Res Lab, NIH, Bethesda, MD 20892 USA
关键词
D O I
10.1016/S0896-6273(03)00869-9
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Midbrain dopamine (DA) neurons are thought to encode reward prediction error. Reward prediction can be improved if any relevant context is taken into account. We found that monkey DA neurons can encode a context-dependent prediction error. In the first noncontextual task, a light stimulus was randomly followed by reward, with a fixed equal probability. The response of DA neurons was positively correlated with the number of preceding unrewarded trials and could be simulated by a conventional temporal difference (TD) model. In the second contextual task, a reward-indicating light stimulus was presented with the probability that, while fixed overall, was incremented as a function of the number of preceding unrewarded trials. The DA neuronal response then was negatively correlated with this number. This history effect corresponded to the prediction error based on the conditional probability of reward and could be simulated only by implementing the relevant context into the TD model.
引用
收藏
页码:269 / 280
页数:12
相关论文
共 57 条
[1]  
ARBIB MA, 1995, MODELS INFORMATION P, P149
[2]  
Barto AG., 1995, Models of information processing in the basal ganglia, P215
[3]   A computational model of how the basal ganglia produce sequences [J].
Berns, GS ;
Sejnowski, TJ .
JOURNAL OF COGNITIVE NEUROSCIENCE, 1998, 10 (01) :108-121
[4]   Visual and anticipatory bias in three cortical eye fields of the monkey during an adaptive decision-making task [J].
Coe, B ;
Tomihara, K ;
Matsuzawa, M ;
Hikosaka, O .
JOURNAL OF NEUROSCIENCE, 2002, 22 (12) :5081-5090
[5]  
DAYAN P, 2002, NIPS, V14, P11
[6]   Metalearning and neuromodulation [J].
Doya, K .
NEURAL NETWORKS, 2002, 15 (4-6) :495-506
[7]   Discrete coding of reward probability and uncertainty by dopamine neurons [J].
Fiorillo, CD ;
Tobler, PN ;
Schultz, W .
SCIENCE, 2003, 299 (5614) :1898-1902
[8]   THE BASAL GANGLIA AND ADAPTIVE MOTOR CONTROL [J].
GRAYBIEL, AM ;
AOSAKI, T ;
FLAHERTY, AW ;
KIMURA, M .
SCIENCE, 1994, 265 (5180) :1826-1831
[9]   Dopamine neurons report an error in the temporal prediction of reward during learning [J].
Hollerman, JR ;
Schultz, W .
NATURE NEUROSCIENCE, 1998, 1 (04) :304-309
[10]  
Houk J., 1995, Models ofInformation Processing in the Basal Ganglia, P249