Dynamics of multilayer networks in the vicinity of temporary minima

被引:24
作者
Ampazis, N
Perantonis, SJ
Taylor, JG [1 ]
机构
[1] Univ London Kings Coll, Dept Math, London WC2R 2LS, England
[2] Natl Res Ctr Demokritos, Inst Informat & Telecommun, Athens, Greece
关键词
feed-forward neural networks; supervised learning; back-propagation; temporary minima; dynamical systems; Jacobian matrix; eigenvalues; constrained optimization;
D O I
10.1016/S0893-6080(98)00103-8
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
A dynamical system model is derived for a single-output, two-layer neural network, which learns according to the back-propagation algorithm. Particular emphasis is placed on the analysis of the occurrence of temporary minima. The Jacobian matrix of the system is derived, whose eigenvalues characterize the evolution of learning. Temporary minima correspond to critical points of the phase plane trajectories, and the bifurcation of the Jacobian matrix eigenvalues signifies their abandonment. Following this analysis, we show that the employment of constrained optimization methods can decrease the time spent in the vicinity of this type of minima. A number of numerical results illustrates the analytical conclusions. (C) 1999 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:43 / 58
页数:16
相关论文
共 25 条
[1]
LEARNING BY ONLINE GRADIENT DESCENT [J].
BIEHL, M ;
SCHWARZE, H .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03) :643-656
[2]
Boyce W. E., 2012, Elementary Differential Equations and Boundary Problems
[3]
EARL A., 1955, THEORY ORDINARY DIFF
[4]
Fahlman S.E., 1988, Proceedings of the 1988 Connectionist Models Summer School, P38
[5]
Gelb A, 1968, Multiple-input describing functions and nonlinear system design
[6]
The new ERA in supervised learning [J].
Gorse, D ;
Shepherd, AJ ;
Taylor, JG .
NEURAL NETWORKS, 1997, 10 (02) :343-352
[7]
GORSE D, 1993, NEURAL NETWORK WORLD, V5, P503
[8]
Graham D., 1961, ANAL NONLINEAR CONTR
[9]
ANALYSIS OF GRADIENT DESCENT LEARNING ALGORITHMS FOR MULTILAYER FEEDFORWARD NEURAL NETWORKS [J].
GUO, H ;
GELFAND, SB .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1991, 38 (08) :883-894
[10]
INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].
JACOBS, RA .
NEURAL NETWORKS, 1988, 1 (04) :295-307