Evolving spike-timing-dependent plasticity for single-trial learning in robots

被引:30
作者
Di Paolo, EA [1 ]
机构
[1] Univ Sussex, Sch Cognit & Comp Sci, Brighton BN1 9QH, E Sussex, England
来源
PHILOSOPHICAL TRANSACTIONS OF THE ROYAL SOCIETY A-MATHEMATICAL PHYSICAL AND ENGINEERING SCIENCES | 2003年 / 361卷 / 1811期
关键词
spiking neural networks; spike-timing-dependent plasticity; activity-dependent synaptic scaling; single-trial learning; evolutionary robotics;
D O I
10.1098/rsta.2003.1256
中图分类号
O [数理科学和化学]; P [天文学、地球科学]; Q [生物科学]; N [自然科学总论];
学科分类号
07 ; 0710 ; 09 ;
摘要
Single-trial learning is studied in an evolved robot model of synaptic spike-timing-dependent plasticity (STDP). Robots must perform positive phototaxis but must learn to perform negative phototaxis in the presence of a short-lived aversive sound stimulus. STDP acts at the millisecond range and depends asymmetrically on the relative timing of pre- and post-synaptic spikes. Although it has been involved in learning models of input prediction, these models require the iterated presentation of the input pattern, and it is hard to see how this mechanism could sustain single-trial learning over a time-scale of tens of seconds. An incremental evolutionary approach is used to answer this question. The evolved robots succeed in learning the appropriate behaviour, but learning does not depend on achieving the right synaptic configuration but rather the right pattern of neural activity. Robot performance during positive phototaxis is quite robust to loss of spike-timing information, but in contrast, this loss is catastrophic for learning negative phototaxis where entrained firing is common. Tests show that the final weight configuration carries no information about whether a robot is performing one behaviour or the other. Fixing weights, however, has the effect of degrading performance, thus demonstrating that plasticity is used to sustain the neural activity corresponding both to the normal phototaxis condition and to the learned behaviour. The implications and limitations of this result are discussed.
引用
收藏
页码:2299 / 2319
页数:21
相关论文
共 40 条
[1]   Synaptic plasticity: taming the beast [J].
Abbott, L. F. ;
Nelson, Sacha B. .
NATURE NEUROSCIENCE, 2000, 3 (11) :1178-1183
[2]   Functional significance of long-term potentiation for sequence learning and prediction [J].
Abbott, LF ;
Blum, KI .
CEREBRAL CORTEX, 1996, 6 (03) :406-416
[3]  
[Anonymous], 1990, Intelligence as Adaptive Behavior: An Experiment in Computational Neuroethology
[4]  
[Anonymous], 1953, LIVING BRAIN
[5]  
Beer R. D., 1996, From Animals to Animats 4. Proceedings of the Fourth International Conference on Simulation of Adaptive Behavior, P421
[6]   Synaptic modification by correlated activity: Hebb's postulate revisited [J].
Bi, GQ ;
Poo, MM .
ANNUAL REVIEW OF NEUROSCIENCE, 2001, 24 :139-166
[7]   Synaptic modifications in cultured hippocampal neurons: Dependence on spike timing, synaptic strength, and postsynaptic cell type [J].
Bi, GQ ;
Poo, MM .
JOURNAL OF NEUROSCIENCE, 1998, 18 (24) :10464-10472
[8]   Spike-timing-dependent plasticity and relevant mutual information maximization [J].
Chechik, G .
NEURAL COMPUTATION, 2003, 15 (07) :1481-1510
[9]   Spike-timing dependent plasticity for evolved robots [J].
Di Paolo, EA .
ADAPTIVE BEHAVIOR, 2002, 10 (3-4) :243-263
[10]   Evolutionary robots with on-line self-organization and behavioral fitness [J].
Floreano, D ;
Urzelai, J .
NEURAL NETWORKS, 2000, 13 (4-5) :431-443