Solving the distal reward problem through linkage of STDP and dopamine signaling

被引:488
作者
Izhikevich, Eugene M. [1 ]
机构
[1] Inst Neurosci, San Diego, CA 92121 USA
基金
美国国家科学基金会;
关键词
classical conditioning; dopamine; instrumental conditioning; reward; simulation; spike-timing-dependent plasticity (STDP);
D O I
10.1093/cercor/bhl152
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
In Pavlovian and instrumental conditioning, reward typically comes seconds after reward-triggering actions, creating an explanatory conundrum known as "distal reward problem": How does the brain know what firing patterns of what neurons are responsible for the reward if 1) the patterns are no longer there when the reward arrives and 2) all neurons and synapses are active during the waiting period to the reward? Here, we show how the conundrum is resolved by a model network of cortical spiking neurons with spike-timing-dependent plasticity (STDP) modulated by dopamine (DA). Although STDP is triggered by nearly coincident firing patterns on a millisecond timescale, slow kinetics of subsequent synaptic plasticity is sensitive to changes in the extracellular DA concentration during the critical period of a few seconds. Random firings during the waiting period to the reward do not affect STDP and hence make the network insensitive to the ongoing activity the key feature that distinguishes our approach from previous theoretical studies, which implicitly assume that the network be quiet during the waiting period or that the patterns be preserved until the reward arrives. This study emphasizes the importance of precise firing patterns in brain dynamics and suggests how a global diffusive reinforcement signal in the form of extracellular DA can selectively influence the right synapses at the right time.
引用
收藏
页码:2443 / 2452
页数:10
相关论文
共 61 条
[1]   DEPENDENCE OF CORTICAL PLASTICITY ON CORRELATED ACTIVITY OF SINGLE NEURONS AND ON BEHAVIORAL CONTEXT [J].
AHISSAR, E ;
VAADIA, E ;
AHISSAR, M ;
BERGMAN, H ;
ARIELI, A ;
ABELES, M .
SCIENCE, 1992, 257 (5075) :1412-1415
[2]  
Au-Young SMW, 1999, SYNAPSE, V34, P245, DOI 10.1002/(SICI)1098-2396(19991215)34:4<245::AID-SYN1>3.0.CO
[3]  
2-D
[4]   Rolipram, a type IV-specific phosphodiesterase inhibitor, facilitates the establishment of long-lasting long-term potentiation and improves memory [J].
Barad, M ;
Bourtchouladze, R ;
Winder, DG ;
Golan, H ;
Kandel, E .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1998, 95 (25) :15020-15025
[5]   NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[6]   Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type [J].
Bi, GQ ;
Poo, MM .
JOURNAL OF NEUROSCIENCE, 1998, 18 (24) :10464-10472
[7]   Dopamine and cAMP-regulated phosphoprotein 32 kDa controls both striatal long-term depression and long-term potentiation, opposing forms of synaptic plasticity [J].
Calabresi, P ;
Gubellini, P ;
Centonze, D ;
Picconi, B ;
Bernardi, G ;
Chergui, K ;
Svenningsson, P ;
Fienberg, AA ;
Greengard, P .
JOURNAL OF NEUROSCIENCE, 2000, 20 (22) :8443-8451
[8]  
CASS WA, 1995, J NEUROCHEM, V65, P201
[9]   Unilateral dopamine denervation blocks corticostriatal LTP [J].
Centonze, D ;
Gubellini, P ;
Picconi, B ;
Calabresi, P ;
Giacomini, P ;
Bernardi, G .
JOURNAL OF NEUROPHYSIOLOGY, 1999, 82 (06) :3575-3579
[10]   Decreased probability of neurotransmitter release underlies striatal long-term depression and postnatal development of corticostriatal synapses [J].
Choi, S ;
Lovinger, DM .
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1997, 94 (06) :2665-2670