Sufficient conditions for error back flow convergence in dynamical recurrent neural networks

被引:1
作者
Aussem, A [1 ]
机构
[1] Univ Clermont Ferrand 2, ISIMA, LIMOS, F-63173 Aubiere, France
来源
IJCNN 2000: PROCEEDINGS OF THE IEEE-INNS-ENNS INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS, VOL IV | 2000年
关键词
recurrent neural networks; gradient descent; forgetting behavior;
D O I
10.1109/IJCNN.2000.860833
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper extends previous analysis of the gradient decay to a class of discrete-time fully recurrent networks, called Dynamical Recurrent Neural Networks (DRNN), obtained by modelling synapses as Finite Impulse Response (FIR) filters instead of multiplicative scalars. Using elementary matrix manipulations, we provide an upper bound on the norm of the weight matrix ensuring that the gradient vector, when propagated in a reverse manner in time through the error-propagation network, decays exponentially to zero. This bounds apply to all FIR architecture proposals as well as fixed point recurrent networks, regardless of delay and connectivity. In addition, we show that the computational overhead of the learning algorithm can be reduced drastically by taking advantage of the exponential decay of the gradient.
引用
收藏
页码:577 / 582
页数:6
相关论文
共 17 条
[1]  
[Anonymous], 1997, Neural Comput
[2]   DYNAMICAL RECURRENT NEURAL NETWORKS - TOWARDS ENVIRONMENTAL TIME-SERIES PREDICTION [J].
AUSSEM, A ;
MURTAGH, F ;
SARAZIN, M .
INTERNATIONAL JOURNAL OF NEURAL SYSTEMS, 1995, 6 (02) :145-170
[3]   Dynamical recurrent neural networks towards prediction and modeling of dynamical systems [J].
Aussem, A .
NEUROCOMPUTING, 1999, 28 :207-232
[4]   GRADIENT DESCENT LEARNING ALGORITHM OVERVIEW - A GENERAL DYNAMICAL-SYSTEMS PERSPECTIVE [J].
BALDI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :182-195
[5]   LEARNING LONG-TERM DEPENDENCIES WITH GRADIENT DESCENT IS DIFFICULT [J].
BENGIO, Y ;
SIMARD, P ;
FRASCONI, P .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1994, 5 (02) :157-166
[6]   On-line learning algorithms for locally recurrent neural networks [J].
Campolucci, P ;
Uncini, A ;
Piazza, F ;
Rao, BD .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1999, 10 (02) :253-271
[7]   LOCAL FEEDBACK MULTILAYERED NETWORKS [J].
FRASCONI, P ;
GORI, M ;
SODA, G .
NEURAL COMPUTATION, 1992, 4 (01) :120-130
[8]  
GOUTTE C, 1997, THESIS U PARIS 6
[9]   Learning long-term dependencies in NARX recurrent neural networks [J].
Lin, TN ;
Horne, BG ;
Tino, P ;
Giles, CL .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (06) :1329-1338
[10]   On the improvement of the real time recurrent learning algorithm for recurrent neural networks [J].
Mak, MW ;
Ku, KW ;
Lu, YL .
NEUROCOMPUTING, 1999, 24 (1-3) :13-36