Eligibility Traces and Plasticity on Behavioral Time Scales: Experimental Support of NeoHebbian Three-Factor Learning Rules

被引:189
作者
Gerstner, Wulfram [1 ]
Lehmann, Marco
Liakoni, Vasiliki
Corneil, Dane
Brea, Johanni
机构
[1] Ecole Polytech Fed Lausanne, Sch Comp Sci, Lausanne, Switzerland
基金
欧洲研究理事会; 瑞士国家科学基金会;
关键词
eligibility trace; hebb rule; reinforcement learning; neuromodulators; surprise; synaptic tagging; synaptic plasticity; behavioral learning; TIMING-DEPENDENT PLASTICITY; LONG-TERM POTENTIATION; NEURAL-NETWORK MODEL; SYNAPTIC PLASTICITY; BASAL GANGLIA; STRUCTURAL PLASTICITY; DOPAMINE; REINFORCEMENT; MEMORY; SYSTEMS;
D O I
10.3389/fncir.2018.00053
中图分类号
Q189 [神经科学];
学科分类号
071006 [神经生物学];
摘要
Most elementary behaviors such as moving the arm to grasp an object or walking into the next room to explore a museum evolve on the time scale of seconds; in contrast, neuronal action potentials occur on the time scale of a few milliseconds. Learning rules of the brain must therefore bridge the gap between these two different time scales. Modern theories of synaptic plasticity have postulated that the co-activation of pre- and postsynaptic neurons sets a flag at the synapse, called an eligibility trace, that leads to a weight change only if an additional factor is present while the flag is set. This third factor, signaling reward, punishment, surprise, or novelty, could be implemented by the phasic activity of neuromodulators or specific neuronal inputs signaling special events. While the theoretical framework has been developed over the last decades, experimental evidence in support of eligibility traces on the time scale of seconds has been collected only during the last few years. Here we review, in the context of three-factor rules of synaptic plasticity, four key experiments that support the role of synaptic eligibility traces in combination with a third factor as a biological implementation of neoHebbian three-factor learning rules.
引用
收藏
页数:16
相关论文
共 160 条
[1]
[Anonymous], 2015, Reinforcement Learning: An Introduction
[2]
[Anonymous], 1949, THE ORGANIZATION OF
[3]
[Anonymous], 1941, Conditioned reflexes and psychiatry
[4]
Spatial cognition and neuro-mimetic navigation: a model of hippocampal place cell activity [J].
Arleo, A ;
Gerstner, W .
BIOLOGICAL CYBERNETICS, 2000, 83 (03) :287-299
[5]
LONG-TERM DEPRESSION OF EXCITATORY SYNAPTIC TRANSMISSION AND ITS RELATIONSHIP TO LONG-TERM POTENTIATION [J].
ARTOLA, A ;
SINGER, W .
TRENDS IN NEUROSCIENCES, 1993, 16 (11) :480-487
[6]
Is heterosynaptic modulation essential for stabilizing Hebbian plasticity and memory? [J].
Bailey, CH ;
Giustetto, M ;
Huang, YY ;
Hawkins, RD ;
Kandel, ER .
NATURE REVIEWS NEUROSCIENCE, 2000, 1 (01) :11-20
[7]
State Based Model of Long-Term Potentiation and Synaptic Tagging and Capture [J].
Barrett, Adam B. ;
Billings, Guy O. ;
Morris, Richard G. M. ;
van Rossum, Mark C. W. .
PLOS COMPUTATIONAL BIOLOGY, 2009, 5 (01)
[8]
Bartlett P. L., 1999, TECHNICAL REPORT
[9]
NEURONLIKE ADAPTIVE ELEMENTS THAT CAN SOLVE DIFFICULT LEARNING CONTROL-PROBLEMS [J].
BARTO, AG ;
SUTTON, RS ;
ANDERSON, CW .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1983, 13 (05) :834-846
[10]
BARTO AG, 1985, HUM NEUROBIOL, V4, P229