The layer-wise method and the backpropagation hybrid approach to learning a feedforward neural network

被引:26
作者
Rubanov, NS [1 ]
机构
[1] Belarusian State Univ, Radiophys Dept, Minsk 220050, BELARUS
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 2000年 / 11卷 / 02期
关键词
feedforward neural network; generalization error; layer-wise learning method; learning time; second-order methods;
D O I
10.1109/72.839001
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Feedforward neural networks (FNN's) have been proposed to solve complex problems in pattern recognition and classification and function approximation. Despite the general success of learning methods for FNN's, such as the backpropagation (BP) algorithm, second-order optimization algorithms and layer-wise learning algorithms, several drawbacks remain to be overcome. In particular, two major drawbacks are convergence to a local minima and long learning time. In this paper we propose an efficient learning method for a FNN that combines the BP strategy and optimization layer by layer. More precisely, we construct the layer-wise optimization method using the Taylor series expansion of nonlinear operators describing a FNN and propose to update weights of each layer by the BP-based Kaczmarz iterative procedure. The experimental results show that the new learning algorithm is stable, it reduces the learning time and demonstrates improvement of generalization results in comparison with other well-known methods.
引用
收藏
页码:295 / 305
页数:11
相关论文
共 26 条
[1]   FAST LEARNING-PROCESS OF MULTILAYER NEURAL NETWORKS USING RECURSIVE LEAST-SQUARES METHOD [J].
AZIMISADJADI, MR ;
LIOU, RJ .
IEEE TRANSACTIONS ON SIGNAL PROCESSING, 1992, 40 (02) :446-450
[2]  
BAERMANN F, 1992, NEURAL NETWORKS, V5
[3]  
BAUSCHKE H, SIAM REV, V38, P367
[4]   NONLINEAR-SYSTEM IDENTIFICATION USING NEURAL NETWORKS [J].
CHEN, S ;
BILLINGS, SA ;
GRANT, PM .
INTERNATIONAL JOURNAL OF CONTROL, 1990, 51 (06) :1191-1214
[5]  
ELLACOT SW, 1994, ACTA NUMERICA
[6]   AN ACCELERATED LEARNING ALGORITHM FOR MULTILAYER PERCEPTRONS - OPTIMIZATION LAYER-BY-LAYER [J].
ERGEZINGER, S ;
THOMSEN, E .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :31-42
[7]  
Fahlman S.E., 1988, An Empirical Study of Learning Speed in Back-Propagation Networks
[8]  
Goodwin G C., 1984, ADAPTIVE FILTERING P
[9]   ANALYSIS OF HIDDEN UNITS IN A LAYERED NETWORK TRAINED TO CLASSIFY SONAR TARGETS [J].
GORMAN, RP ;
SEJNOWSKI, TJ .
NEURAL NETWORKS, 1988, 1 (01) :75-89
[10]   Neural networks expand SP's horizons [J].
Haykin, S .
IEEE SIGNAL PROCESSING MAGAZINE, 1996, 13 (02) :24-49