Globally optimal parameters for on-line learning in multilayer neural networks

被引:31
作者
Saad, D
Rattray, M
机构
[1] Department of Computer Science and Applied Mathematics, Aston University, Birmingham
关键词
D O I
10.1103/PhysRevLett.79.2578
中图分类号
O4 [物理学];
学科分类号
0702 ;
摘要
We present a framework for calculating globally optimal parameters, within a given time frame, for on-line learning in multilayer neural networks. We demonstrate the capability of this method by computing optimal learning rates in typical learning scenarios. A similar treatment allows one to determine the relevance of related training algorithms based on modifications to the basic gradient descent rule as well as to compare different training methods.
引用
收藏
页码:2578 / 2581
页数:4
相关论文
共 10 条
[1]   LEARNING BY ONLINE GRADIENT DESCENT [J].
BIEHL, M ;
SCHWARZE, H .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03) :643-656
[2]  
Bishop C. M., 1995, Neural networks for pattern recognition
[3]  
Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
[4]   OPTIMAL GENERALIZATION IN PERCEPTRONS [J].
KINOUCHI, O ;
CATICHA, N .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1992, 25 (23) :6243-6250
[5]  
RATTRAY M, UNPUB
[6]  
RIGLER P, 1995, J PHYS A, V28, pL507
[7]   EXACT SOLUTION FOR ONLINE LEARNING IN MULTILAYER NEURAL NETWORKS [J].
SAAD, D ;
SOLLA, SA .
PHYSICAL REVIEW LETTERS, 1995, 74 (21) :4337-4340
[8]   ONLINE LEARNING IN SOFT COMMITTEE MACHINES [J].
SAAD, D ;
SOLLA, SA .
PHYSICAL REVIEW E, 1995, 52 (04) :4225-4243
[9]   On-line learning with adaptive back-propagation in two-layer networks [J].
West, AHL ;
Saad, D .
PHYSICAL REVIEW E, 1997, 56 (03) :3426-3445
[10]   Learning in Artificial Neural Networks: A Statistical Perspective [J].
White, Halbert .
NEURAL COMPUTATION, 1989, 1 (04) :425-464