Advanced neural-network training algorithm with reduced complexity based on Jacobian deficiency

被引:45
作者
Zhou, G [1 ]
Si, J [1 ]
机构
[1] Arizona State Univ, Dept Elect Engn, Tempe, AZ 85287 USA
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1998年 / 9卷 / 03期
基金
美国国家科学基金会;
关键词
Gauss-Newton method; Jacobian rank deficiency; neural-network training; subset updating; trust region algorithms;
D O I
10.1109/72.668886
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper we introduce an advanced supervised training method for neural networks. It is based on Jacobian rank deficiency and it is formulated, in some sense, in the spirit of the Gauss-Newton algorithm. The Levenberg-Marquardt algorithm, as a modified Gauss-Newton, has been used successfully in solving nonlinear least squares problems including neural-network training. It outperforms (in terms of training accuracy, convergence properties, overall training time, etc.) the basic backpropagation and its variations with variable learning rate significantly, however, with higher computation and memory complexities within each iteration. The new method developed in this paper is aiming at improving convergence properties, while reducing the memory and computation complexities in supervised training of neural networks. Extensive simulation results are provided to demonstrate the superior performance of the new algorithm over the Levenberg-Marquardt algorithm.
引用
收藏
页码:448 / 453
页数:6
相关论文
共 11 条
[1]   1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].
BATTITI, R .
NEURAL COMPUTATION, 1992, 4 (02) :141-166
[2]  
DENNIS JE, 1983, NUMERICAL METHODS UN
[3]  
DONGARRA JJ, 1979, LINPACK USERS GUIDE
[4]  
HAGEN MT, 1994, IEEE T NEURAL NETWOR, V5, P989
[5]  
Hertz J., 1991, Introduction to the Theory of Neural Computation
[6]   AN ADAPTIVE LEAST-SQUARES ALGORITHM FOR THE EFFICIENT TRAINING OF ARTIFICIAL NEURAL NETWORKS [J].
KOLLIAS, S ;
ANASTASSIOU, D .
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (08) :1092-1101
[7]  
Luyben W.L., 1990, PROCESS MODELING SIM, V2nd
[8]  
Narendra K S, 1990, IEEE Trans Neural Netw, V1, P4, DOI 10.1109/72.80202
[9]  
SAARINEN S, 1991, 1089 CRSD U ILL CTR
[10]   NONSTANDARD SCALING MATRICES FOR TRUST REGION GAUSS-NEWTON METHODS [J].
SCHWETLICK, H ;
TILLER, V .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1989, 10 (04) :654-670