Finite-size effects in on-line learning of multilayer neural networks

被引:11
作者
Barber, D [1 ]
Saad, D [1 ]
Sollich, P [1 ]
机构
[1] UNIV EDINBURGH,DEPT PHYS,EDINBURGH EH9 3JZ,MIDLOTHIAN,SCOTLAND
来源
EUROPHYSICS LETTERS | 1996年 / 34卷 / 02期
关键词
D O I
10.1209/epl/i1996-00431-5
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We complement recent advances in thermodynamic limit analyses of mean on-line gradient descent learning dynamics in multilayer networks by calculating fluctuations possessed by finite-dimensional systems. Fluctuations from the mean dynamics are largest at the onset of specialisation as student hidden unit weight vectors begin to imitate specific teacher vectors, increasing with the degree of symmetry of the initial conditions. In light of this, we include a term to stimulate asymmetry in the learning process, which typically also leads to a significant decrease in training time.
引用
收藏
页码:151 / 156
页数:6
相关论文
共 6 条
[1]   LEARNING BY ONLINE GRADIENT DESCENT [J].
BIEHL, M ;
SCHWARZE, H .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03) :643-656
[2]   ONLINE LEARNING IN THE COMMITTEE MACHINE [J].
COPELLI, M ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06) :1615-1625
[3]   ON FOKKER-PLANCK APPROXIMATIONS OF ONLINE LEARNING-PROCESSES [J].
HESKES, T .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1994, 27 (15) :5145-5160
[4]   ONLINE LEARNING IN SOFT COMMITTEE MACHINES [J].
SAAD, D ;
SOLLA, SA .
PHYSICAL REVIEW E, 1995, 52 (04) :4225-4243
[5]   FINITE-SIZE EFFECTS IN LEARNING AND GENERALIZATION IN LINEAR PERCEPTRONS [J].
SOLLICH, P .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1994, 27 (23) :7771-7784
[6]  
Van Kampen N. G., 1992, STOCHASTIC PROCESSES