Temporal difference model reproduces anticipatory neural activity

被引：109

作者：

Suri, RE ^{[1
]}

Schultz, W ^{[1
]}

机构：

[1] Univ Fribourg, Inst Physiol, CH-1700 Fribourg, Switzerland

来源：

NEURAL COMPUTATION | 2001年 / 13卷 / 04期

关键词：

D O I：

10.1162/089976601300014376

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Anticipatory neural activity preceding behaviorally important events has been reported in cortex, striatum, and midbrain dopamine neurons. Whereas dopamine neurons are phasically activated by reward-predictive stimuli, anticipatory activity of cortical and striatal neurons is increased during delay periods before important events. Characteristics of dopamine neuron activity resemble those of the prediction error signal of the temporal difference (TD) model of Pavlovian learning (Sutton & Barto, 1990). This study demonstrates that the prediction signal of the TD model reproduces characteristics of cortical and striatal anticipatory neural activity. This finding suggests that tonic anticipatory activities may reflect prediction signals that are involved in the processing of dopamine neuron activity.

引用

页码：841 / 862

页数：22

共 54 条

[41] Reward processing in primate orbitofrontal cortex and basal ganglia [J].