Two highly efficient second-order algorithms for training feedforward networks

被引:118
作者
Ampazis, N [1 ]
Perantonis, SJ [1 ]
机构
[1] Natl Ctr Sci Res Demokritos, Inst Informat & Telecommun, GR-15310 Athens, Greece
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2002年 / 13卷 / 05期
关键词
algorithms; multilayer feedforward neural networks; nonlinear least-squares; optimization; supervised learning;
D O I
10.1109/TNN.2002.1031939
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper, we present two highly efficient second-order algorithms for the training of multilayer feedforward neural networks. The algorithms are based on iterations of the form employed in the Levenberg-Marquardt (LM) method for nonlinear least squares problems with the inclusion of an additional adaptive momentum term arising from the formulation of the training task as a constrained optimization problem. Their implementation requires minimal additional computations compared to a standard LM iteration which are compensated, however, from their excellent convergence properties. Simulations to large scale classical neural-network benchmarks are presented which reveal the power of the two methods to obtain solutions in difficult problems, whereas other standard second-order techniques (including LM) fail to converge.
引用
收藏
页码:1064 / 1074
页数:11
相关论文
共 29 条
[1]   Dynamics of multilayer networks in the vicinity of temporary minima [J].
Ampazis, N ;
Perantonis, SJ ;
Taylor, JG .
NEURAL NETWORKS, 1999, 12 (01) :43-58
[2]  
[Anonymous], 1989, Complex Syst
[3]  
[Anonymous], 2005, NEURAL NETWORKS PATT
[4]   1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].
BATTITI, R .
NEURAL COMPUTATION, 1992, 4 (02) :141-166
[5]  
Dennis J.E., 1996, NUMERICAL METHODS UN
[6]   QUASI-NEWTON METHODS, MOTIVATION AND THEORY [J].
DENNIS, JE ;
MORE, JJ .
SIAM REVIEW, 1977, 19 (01) :46-89
[7]  
FAHLMAN S, P 1988 CONN MOD SUMM, P38
[8]  
Fahlman S. E., 1990, ADV NEURAL INFORMATI, P524, DOI DOI 10.1190/1.1821929
[9]   COMPUTING OPTIMAL LOCALLY CONSTRAINED STEPS [J].
GAY, DM .
SIAM JOURNAL ON SCIENTIFIC AND STATISTICAL COMPUTING, 1981, 2 (02) :186-197
[10]  
GILBRET JC, 1992, SIAM J OPTIMIZ, V2