LEARNING COEFFICIENT DEPENDENCE ON TRAINING SET SIZE

被引:31
作者
EATON, HAC
OLIVIER, TL
机构
关键词
ALPHA; BATCH TRAINING; COEFFICIENT; ETA; MOMENTUM; PRIORITY ENCODER; TRAINING RATE; Z-TRANSFORM;
D O I
10.1016/S0893-6080(05)80026-7
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A rule for the selection of the learning coefficient, eta, for use in back propagation with batch training of neural networks is presented. The length of the error gradient is shown to increase as more training set examples are presented. This results in slow training or nonconvergence if eta is not decreased as the number of input examples increases. The effect of a momentum term is shown to allow a range of eta's to produce similar training rates. Two networks having identical topology are trained at different tasks, one with few training patterns (16) and one with many (192). Distinctly different values of eta are shown to produce good training for the two networks. We propose selecting eta-equal to 1.5 divided by the square root of the sum of the squares of the number of each input pattern type. Any group of similar inputs that map to identical outputs constitutes a pattern type. This rule produces a fixed value of eta that yields rapid training when coupled with a momentum coefficient of 0.9 for a wide variety of networks.
引用
收藏
页码:283 / 288
页数:6
相关论文
共 9 条
[1]  
BATTITI R, 1990, P INT JOINT NEURAL N, V1, P593
[2]  
HIGASHINO J, 1990, P INT JOINT C NEUR N, V1, P627
[3]   INCREASED RATES OF CONVERGENCE THROUGH LEARNING RATE ADAPTATION [J].
JACOBS, RA .
NEURAL NETWORKS, 1988, 1 (04) :295-307
[4]  
KUNG SY, 1988, P INT JOINT C NEURAL, V1, P363
[5]   Backpropagation Applied to Handwritten Zip Code Recognition [J].
LeCun, Y. ;
Boser, B. ;
Denker, J. S. ;
Henderson, D. ;
Howard, R. E. ;
Hubbard, W. ;
Jackel, L. D. .
NEURAL COMPUTATION, 1989, 1 (04) :541-551
[6]  
MINAI AA, 1990, P IEEE INNS INT C NE, V1, P676
[7]  
Rumelhart DE, 1986, ENCY DATABASE SYST, P45
[8]   ACCELERATING THE CONVERGENCE OF THE BACK-PROPAGATION METHOD [J].
VOGL, TP ;
MANGIS, JK ;
RIGLER, AK ;
ZINK, WT ;
ALKON, DL .
BIOLOGICAL CYBERNETICS, 1988, 59 (4-5) :257-263
[9]  
WATROUS RL, 1986, P INT JOINT C NEURAL, V2, P619