LEARNING WITH 1ST, 2ND, AND NO DERIVATIVES - A CASE-STUDY IN HIGH-ENERGY PHYSICS

被引:22
作者
BATTITI, R
TECCHIOLLI, G
机构
[1] UNIV TRENT,DIPARTIMENTO MATEMAT,I-38050 TRENT,ITALY
[2] INFN,COLLEGATO GRP,TRENT,ITALY
[3] IST RIC SCI & TECNOL,I-38050 TRENT,ITALY
关键词
STOCHASTIC OPTIMIZATION; BACKPROPAGATION; MULTILAYER PERCEPTRON; NEURAL NETWORK CLASSIFIERS; EVENT DISCRIMINATION;
D O I
10.1016/0925-2312(94)90054-X
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In this paper different algorithms for training multilayer perceptron architectures are applied to a significant discrimination task in high energy physics. The One-Step Secant technique is compared with on-line backpropagation, the 'Bold Driver' batch version and conjugate gradient methods. In addition, a new algorithm (affine shaker) is proposed that uses stochastic search based on function values and affine transformations of the local search region. Although the affine shaker requires more CPU time to reach the maximum generalization, the technique can be interesting for special-purpose VLSI implementations and for non-differentiable functions.
引用
收藏
页码:181 / 206
页数:26
相关论文
共 33 条
[1]   NEURAL THEORY OF ASSOCIATION AND CONCEPT-FORMATION [J].
AMARI, SI .
BIOLOGICAL CYBERNETICS, 1977, 26 (03) :175-185
[2]  
[Anonymous], P INT NEUR NETW C PA
[3]  
BARNARD E, 1989, CSE89014 OR GRAD I S
[4]   1ST-ORDER AND 2ND-ORDER METHODS FOR LEARNING - BETWEEN STEEPEST DESCENT AND NEWTON METHOD [J].
BATTITI, R .
NEURAL COMPUTATION, 1992, 4 (02) :141-166
[5]  
Battiti R., 1989, COMPLEX SYSTEMS, V3, P331
[6]  
BATTITI R, 1993, UTM421 U TRENT DIP M
[7]  
BISHOP C, 1992, NEURAL COMPUTAT, V4, P949
[8]  
BRUNELLI R, IN PRESS J COMPUTAT
[9]  
BRUNELLI R, 1992, IRST921206 TECH REP
[10]  
BRUNELLI R, 1991, BIOL CYBERN, V65, P501