KICK-OUT LEARNING ALGORITHM TO REDUCE THE OSCILLATION OF WEIGHTS

被引:19
作者
OCHIAI, K
TODA, N
USUI, S
机构
[1] TOYOHASHI UNIV TECHNOL,DEPT INFORMAT & COMPUT SCI,HIBARIGAOKA TEMPA KU,TOYOHASHI 441,JAPAN
[2] MAIZURU COLL TECHNOL,MAIZURU,JAPAN
关键词
ACCELERATED LEARNING ALGORITHM; WEIGHTS OSCILLATION; RAVINE; MOMENTUM TERM; CORRECTING TERM; DIFFERENCE OF GRADIENTS;
D O I
10.1016/0893-6080(94)90101-5
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The back-propagation algorithm, when used with an unmodified gradient descent term, converges very slowly, because the weights oscillate in regions where the error surface forms a ravine. To improve the convergence, the momentum term was introduced. However, the effect of that term on the reduction of oscillations has been insufficiently considered. In this paper, we point out that this term has not been effective in reducing the oscillation. To overcome the oscillations, we focus on the very bottom of a ravine where the direction of steepest descent is the same as the downward direction along the ravine bottom. We describe a method to correct the value of the weights near the bottom of a ravine and propose a new acceleration algorithm based on that correction. The distinctive feature is the correction term that uses the difference of gradients that is invoked during the oscillation. We show that, using the proposed algorithm, the convergence speed is substantially improved in ravine regions.
引用
收藏
页码:797 / 807
页数:11
相关论文
共 18 条
[2]  
FAHLMAN SE, 1988, 1988 P CONN MOD SUMM, P38
[3]  
Fletcher R., 1987, PRACTICAL METHODS OP
[4]   CLASSIFICATION OF RADAR CLUTTER USING NEURAL NETWORKS [J].
HAYKIN, S ;
CONG, D .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (06) :589-600
[5]  
HUSH DR, 1992, IEEE T SYST MAN CYB, V5, P1152
[6]   INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].
JACOBS, RA .
NEURAL NETWORKS, 1988, 1 (04) :295-307
[7]  
JACOBY SLS, 1972, ITERATIVE METHODS NO, pCH1
[8]  
MINAI AA, 1990, INT JOINT C NEURAL N, V1, P676
[9]  
Minsky M., 1971, PERCEPTRONS
[10]   RESCALING OF VARIABLES IN BACK PROPAGATION LEARNING [J].
RIGLER, AK ;
IRVINE, JM ;
VOGL, TP .
NEURAL NETWORKS, 1991, 4 (02) :225-229