Dopamine neurons report an error in the temporal prediction of reward during learning

被引:791
作者
Hollerman, JR [1 ]
Schultz, W [1 ]
机构
[1] Univ Fribourg, Inst Physiol, CH-1700 Fribourg, Switzerland
关键词
D O I
10.1038/1124
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
Many behaviors are affected by rewards, undergoing long-term changes when rewards are different than predicted but remaining unchanged when rewards occur exactly as predicted. The discrepancy between reward occurrence and reward prediction is termed an 'error in reward prediction'. Dopamine neurons in the substantia nigra and the ventral tegmental area are believed to be involved in reward-dependent behaviors. Consistent with this role, they are activated by rewards, and because they are activated more strongly by unpredicted than by predicted rewards they may play a role in learning. The present study investigated whether monkey dopamine neurons code an error in reward prediction during the course of learning. Dopamine neuron responses reflected the changes in reward prediction during individual learning episodes; dopamine neurons were activated by rewards during early trials, when errors were frequent and rewards unpredictable, but activation was progressively reduced as performance was consolidated and rewards became more predictable. These neurons were also activated when rewards occurred at unpredicted times and were depressed when rewards were omitted at the predicted times. Thus, dopamine neurons code errors in the prediction of both the occurrence and the time of rewards. In this respect, their responses resemble the teaching signals that have been employed in particularly efficient computational learning models.
引用
收藏
页码:304 / 309
页数:6
相关论文
共 47 条
[11]   SURPRISE AND ATTENUATION OF BLOCKING [J].
DICKINSON, A ;
HALL, G ;
MACKINTOSH, NJ .
JOURNAL OF EXPERIMENTAL PSYCHOLOGY-ANIMAL BEHAVIOR PROCESSES, 1976, 2 (04) :313-322
[12]  
Dickinson A., 1980, CONT ANIMAL LEARNING
[13]  
GRAFFAN EA, 1988, J NEUROSCI, V8, P3144
[14]   FUNCTIONAL-PROPERTIES OF MONKEY CAUDATE NEURONS .3. ACTIVITIES RELATED TO EXPECTATION OF TARGET AND REWARD [J].
HIKOSAKA, O ;
SAKAMOTO, M ;
USUI, S .
JOURNAL OF NEUROPHYSIOLOGY, 1989, 61 (04) :814-832
[15]   Burst activity of ventral tegmental dopamine neurons is elicited by sensory stimuli in the awake cat [J].
Horvitz, JC ;
Stewart, T ;
Jacobs, BL .
BRAIN RESEARCH, 1997, 759 (02) :251-258
[16]  
Houk J., 1995, Models ofInformation Processing in the Basal Ganglia, P249
[17]   RESPONSES OF MONKEY DOPAMINE NEURONS DURING LEARNING OF BEHAVIORAL REACTIONS [J].
LJUNGBERG, T ;
APICELLA, P ;
SCHULTZ, W .
JOURNAL OF NEUROPHYSIOLOGY, 1992, 67 (01) :145-163
[18]   THEORY OF ATTENTION - VARIATIONS IN ASSOCIABILITY OF STIMULI WITH REINFORCEMENT [J].
MACKINTOSH, NJ .
PSYCHOLOGICAL REVIEW, 1975, 82 (04) :276-298
[19]   VISUAL AND OCULOMOTOR FUNCTIONS OF MONKEY SUBTHALAMIC NUCLEUS [J].
MATSUMURA, M ;
KOJIMA, J ;
GARDINER, TW ;
HIKOSAKA, O .
JOURNAL OF NEUROPHYSIOLOGY, 1992, 67 (06) :1615-1632
[20]   Preferential activation of midbrain dopamine neurons by appetitive rather than aversive stimuli [J].
Mirenowicz, J ;
Schultz, W .
NATURE, 1996, 379 (6564) :449-451