Advances in feedforward neural networks: Demystifying knowledge acquiring black boxes

被引:160
作者
Looney, CG
机构
[1] Computer Science Department, University of Nevada, Reno
关键词
feedforward neural networks; multilayered perceptrons; architecture; training; backpropagation; adaptive learning rate; pattern recognition;
D O I
10.1109/69.494162
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We survey research of recent years on the supervised training of feedforward neural networks. The goal is to expose how the networks work, how to engineer them so they can learn data with less extraneous noise, how to train them efficiently, and how to assure that the training is valid. The scope covers gradient descent and polynomial line search, from backpropagation through conjugate gradients and quasi-Newton methods. There is a consensus among researchers that adaptive step gains (learning rates) can stabilize and accelerate convergence and that a good starling weight set improves both the training speed and the learning quality. The training problem includes both the design of a network function and the fitting of the function to a set of input and output data points by computing a set of coefficient weights. The form of the function can be adjusted by adjoining new neurons and pruning existing ones and setting other parameters such as biases and exponential rates. Our exposition reveals several useful results that are readily implementable.
引用
收藏
页码:211 / 226
页数:16
相关论文
共 77 条
[1]  
[Anonymous], THESIS STANFORD U
[2]  
[Anonymous], NEURAL NETWORKS CONT
[3]  
[Anonymous], 1989, P INT JOINT C NEUR N, P443
[4]  
[Anonymous], P INT JOINT C NEUR N
[5]  
[Anonymous], P INT JOINT C NEUR N
[6]  
BARNARD E, 1989, P IEEE INNS INT JOIN, V1, P111
[7]   What Size Net Gives Valid Generalization? [J].
Baum, Eric B. ;
Haussler, David .
NEURAL COMPUTATION, 1989, 1 (01) :151-160
[8]   NEURAL-NETWORK DESIGN USING VORONOI DIAGRAMS [J].
BOSE, NK ;
GARGA, AK .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (05) :778-787
[9]  
BOSE NK, 1992, INT JOINT IEEE INNS, V3, P127
[10]  
Burkitt A. N., 1991, Complex Systems, V5, P371