On-line learning with adaptive back-propagation in two-layer networks

被引:16
作者
West, AHL [1 ]
Saad, D [1 ]
机构
[1] UNIV EDINBURGH,DEPT PHYS,EDINBURGH EH9 3JZ,MIDLOTHIAN,SCOTLAND
来源
PHYSICAL REVIEW E | 1997年 / 56卷 / 03期
关键词
D O I
10.1103/PhysRevE.56.3426
中图分类号
O35 [流体力学]; O53 [等离子体物理学];
学科分类号
070204 ; 080103 ; 080704 ;
摘要
An adaptive back-propagation algorithm parametrized by an inverse temperature beta is studied and compared with gradient descent (standard back-propagation) for on-line learning in two-layer neural networks with an arbitrary number of hidden units. Within a statistical mechanics framework, we analyze these learning algorithms in both the symmetric and the convergence phase for finite learning rates in the case of uncorrelated teachers of similar but arbitrary length T. These analyses show that adaptive back-propagation results generally in faster training by breaking the symmetry between hidden units more efficiently and by providing faster convergence to optimal generalization than gradient descent.
引用
收藏
页码:3426 / 3445
页数:20
相关论文
共 17 条
[1]  
Amari S, 1997, ADV NEUR IN, V9, P127
[2]   Finite-size effects in on-line learning of multilayer neural networks [J].
Barber, D ;
Saad, D ;
Sollich, P .
EUROPHYSICS LETTERS, 1996, 34 (02) :151-156
[3]   ONLINE LEARNING WITH A PERCEPTRON [J].
BIEHL, M ;
RIEGLER, P .
EUROPHYSICS LETTERS, 1994, 28 (07) :525-530
[4]   LEARNING BY ONLINE GRADIENT DESCENT [J].
BIEHL, M ;
SCHWARZE, H .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03) :643-656
[5]   ONLINE LEARNING IN THE COMMITTEE MACHINE [J].
COPELLI, M ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (06) :1615-1625
[6]   Equivalence between learning in noisy perceptrons and tree committee machines [J].
Copelli, M ;
Kinouchi, O ;
Caticha, N .
PHYSICAL REVIEW E, 1996, 53 (06) :6341-6352
[7]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[8]   On-line Gibbs learning [J].
Kim, JW ;
Sompolinsky, H .
PHYSICAL REVIEW LETTERS, 1996, 76 (16) :3021-3024
[9]   OPTIMAL GENERALIZATION IN PERCEPTRONS [J].
KINOUCHI, O ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1992, 25 (23) :6243-6250
[10]  
LEEN T, 1994, ADV NEURAL INFORMATI, V6, P477