A new weight initialization method for the MLP with the BP in multiclass classification problems

被引:13
作者
Kim, MC [1 ]
Choi, CH [1 ]
机构
[1] SEOUL NATL UNIV,ASRI,SCH ELECT ENGN,ERC ACI,SEOUL 151742,SOUTH KOREA
关键词
BP; initial learning process; MLP; multiclass classification problems; random weight initialization; weight initialization;
D O I
10.1023/A:1009680422241
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Initial learning process of the BP, which can influence the performance of learning in multiclass classification problems, is analyzed. Also, the weights decreasing phenomena in the initial stage of learning are investigated. On the basis of this analysis, a new initialization method is proposed. The proposed method minimizes the initial objective function. It eliminates the phenomenon that weights decrease in the beginning of learning. Several simulation results show that the proposed initialization method performs much better than the conventional random initialization method in the batch mode and slightly better in the pattern mode. Since it requires only a little additional computation, it is a strong alternative to the conventional random initialization. It is expected that the proposed initialization method can be used with any accelerated learning algorithm to enhance the learning speed.
引用
收藏
页码:11 / 23
页数:13
相关论文
共 20 条
[1]   EFFICIENT CLASSIFICATION FOR MULTICLASS PROBLEMS USING MODULAR NEURAL NETWORKS [J].
ANAND, R ;
MEHROTRA, K ;
MOHAN, CK ;
RANKA, S .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :117-124
[2]  
[Anonymous], IEEE
[3]  
[Anonymous], 1988, P 1988 CONN MOD SUMM, DOI DOI 10.13140/2.1.3459.2329
[4]  
[Anonymous], INTRO THEORY NEURAL
[5]  
[Anonymous], P INT JOINT C NEUR N
[6]  
ATIYA AF, 1992, P INT JOINT C NEUR N, V3, P925
[7]  
BECKER S, 1988, 1988 P CONN MOD SUMM, P29
[8]   AN ACCELERATED LEARNING ALGORITHM FOR MULTILAYER PERCEPTRONS - OPTIMIZATION LAYER-BY-LAYER [J].
ERGEZINGER, S ;
THOMSEN, E .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1995, 6 (01) :31-42
[9]  
JACOBS RA, 1988, NEURAL NETWORKS, V1, P325
[10]  
KIM MC, 1994, P ICONIP 94, V2, P761