A new class of incremental gradient methods for least squares problems

被引:207
作者
Bertsekas, DP [1 ]
机构
[1] MIT, Dept Elect Engn & Comp Sci, Cambridge, MA 02139 USA
关键词
gradient methods; nonlinear programming; least squares; neural networks;
D O I
10.1137/S1052623495287022
中图分类号
O29 [应用数学];
学科分类号
070104 ;
摘要
The least mean squares (LMS) method for linear least squares problems differs from the steepest descent method in that it processes data blocks one-by-one, with intermediate adjustment of the parameter vector under optimization. This mode of operation often leads to faster convergence when far from the eventual limit and to slower (sublinear) convergence when close to the optimal solution. We embed both LMS and steepest descent, as well as other intermediate methods, within a one-parameter class of algorithms, and we propose a hybrid class of methods that combine the faster early convergence rate of LMS with the faster ultimate linear convergence rate of steepest descent. These methods are well suited for neural network training problems with large data sets. Furthermore, these methods allow the effective use of scaling based, for example, on diagonal or other approximations of the Hessian matrix.
引用
收藏
页码:913 / 926
页数:14
相关论文
共 21 条
[1]  
[Anonymous], NONLINEAR PROGRAMMIN
[2]  
[Anonymous], AUTOM REMOTE CONTROL
[3]   THE ITERATED KALMAN SMOOTHER AS A GAUSS-NEWTON METHOD [J].
BELL, BM .
SIAM JOURNAL ON OPTIMIZATION, 1994, 4 (03) :626-636
[4]   Incremental least squares methods and the extended Kalman filter [J].
Bertsekas, DP .
SIAM JOURNAL ON OPTIMIZATION, 1996, 6 (03) :807-822
[5]  
Bertsekas P. Dimitri, 1989, PARALLEL DISTRIBUTED
[6]  
BERTSEKAS PD, 1996, NEURODYNAMIC PROGRAM
[7]   NEW LEAST-SQUARE ALGORITHMS [J].
DAVIDON, WC .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1976, 18 (02) :187-197
[8]  
GAIVORONSKI AA, 1994, OPTIMIZATION METHODS, V4, P117, DOI DOI 10.1080/10556789408805582
[9]  
GRIPPO L, 1994, OPTIM METHOD SOFTW, V4, P135, DOI 10.1080/10556789408805583.3
[10]  
KUSHNER HJ, 1978, STOCHASTIC APPROXIMA