Learning precise timing with LSTM recurrent networks

被引:1249
作者
Gers, FA
Schraudolph, NN
Schmidhuber, J
机构
[1] IDSIA, CH-6928 Manno, Switzerland
[2] ETH Zentrum, Inst Computat Sci, CH-8092 Zurich, Switzerland
关键词
recurrent neural networks; long short-term memory; timing;
D O I
10.1162/153244303768966139
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
The temporal distance between events conveys information essential for numerous sequential tasks such as motor control and rhythm detection. While Hidden Markov Models tend to ignore this information, recurrent neural networks (RNNs) can in principle learn to make use of it. We focus on Long Short-Term Memory (LSTM) because it has been shown to outperform other RNNs on tasks involving long time lags. We find that LSTM augmented by "peephole connections" from its internal cells to its multiplicative gates can learn the fine distinction between sequences of spikes spaced either 50 or 49 time steps apart without the help of any short training exemplars. Without external resets or teacher forcing, our LSTM variant also learns to generate stable streams of precisely timed spikes and other highly nonlinear periodic patterns. This makes LSTM a promising approach for tasks that require the accurate measurement or generation of time intervals.
引用
收藏
页码:115 / 143
页数:29
相关论文
共 24 条
[1]
LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[2]
BENGIO Y, 1995, ADV NEURAL INFORMATI, V7
[3]
A RECURRENT NETWORK FOR MODELING NOISY TEMPORAL SEQUENCES [J].
BULSARI, AB ;
SAXEN, H .
NEUROCOMPUTING, 1995, 7 (01) :29-40
[4]
Cummins F, 1999, P EUROSPEECH 99, P371
[5]
ADAPTIVE NEURAL OSCILLATOR USING CONTINUOUS-TIME BACK-PROPAGATION LEARNING [J].
DOYA, K ;
YOSHIZAWA, S .
NEURAL NETWORKS, 1989, 2 (05) :375-385
[6]
ECK D, 2002, LECT NOTES COMPUTER
[7]
LEARNING THE INITIAL-STATE OF A 2ND-ORDER RECURRENT NEURAL-NETWORK DURING REGULAR-LANGUAGE INFERENCE [J].
FORCADA, ML ;
CARRASCO, RC .
NEURAL COMPUTATION, 1995, 7 (05) :923-930
[8]
Fu-Sheng Tsung, 1995, Advances in Neural Information Processing Systems 7, P481
[9]
Learning to forget: Continual prediction with LSTM [J].
Gers, FA ;
Schmidhuber, J ;
Cummins, F .
NEURAL COMPUTATION, 2000, 12 (10) :2451-2471
[10]
GERS FA, 2001, IEEE T NEURAL NETWOR