Theoretical and numerical analysis of learning dynamics near singularity in multilayer perceptrons

被引：13

作者：

Guo, Weili ^{[1
]}

Wei, Haikun ^{[1
]}

Zhao, Junsheng ^{[1
,2
]}

Zhang, Kanjian ^{[1
]}

机构：

[1] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China

[2] Liaocheng Univ, Sch Math Sci, Liaocheng 252059, Peoples R China

来源：

NEUROCOMPUTING | 2015年 / 151卷

基金：

国家自然科学基金重大项目; 中国国家自然科学基金; 高等学校博士学科点专项科研基金;

关键词：

Multilayer perceptrons; Learning dynamics; Theoretical analysis; Numerical analysis; Singularity;

D O I：

10.1016/j.neucom.2014.09.026

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

The multilayer perceptron is one of the most widely used neural networks in applications, however, its learning behavior often becomes very slow, which is due to the singularities in the parameter space. In this paper, we analyze the learning dynamics near singularities in multilayer perceptrons by using traditional methods. We obtain the explicit expressions of the averaged learning equations which play a significant role in theoretical and numerical analysis. After obtaining the best approximation on overlap singularity, the stability of overlap singularity is analyzed. Then we take the numerical analysis on singular regions. Real averaged dynamics near the singularities are obtained in comparison with the theoretical learning trajectories near singularity. In the simulation we analyze the averaged learning dynamics, batch mode learning dynamics and on-line learning dynamics, respectively. (C) 2014 Elsevier B.V. All rights reserved.

引用

页码：390 / 400

页数：11

共 18 条

[1] Amari S, 2001, IEICE T FUND ELECTR, VE84A, P31
[2] AMARI S, 2000, INFORM GEOMETRY
[3] Singularities affect dynamics of learning in neuromanifolds
Amari, Shun-ichi
Park, Hyeyoung
Ozeki, Tomoko
[J]. NEURAL COMPUTATION, 2006, 18 (05) : 1007 - 1065
[4] Dynamics of Learning In Hierarchical Models - Singularity and Milnor Attractor
Amari, Shun-ichi
Ozeki, Tomoko
Cousseau, Florent
Wei, Haikun
[J]. ADVANCES IN COGNITIVE NEURODYNAMICS (II), 2011, : 3 - 9
[5] LEARNING BY ONLINE GRADIENT DESCENT
BIEHL, M
SCHWARZE, H
[J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03): : 643 - 656
[6] Dynamics of learning in multilayer perceptrons near singularities
Cousseau, Florent
Ozeki, Tomoko
Amari, Shun-ichi
[J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (08): : 1313 - 1328
[7] Local minima and plateaus in hierarchical structures of multilayer perceptrons
Fukumizu, K
Amari, S
[J]. NEURAL NETWORKS, 2000, 13 (03) : 317 - 327
[8] A regularity condition of the information matrix of a multilayer perceptron network
Fukumizu, K
[J]. NEURAL NETWORKS, 1996, 9 (05) : 871 - 879
[9] Online learning dynamics of multilayer perceptrons with unidentifiable parameters
Park, H
Inoue, M
Okada, M
[J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2003, 36 (47): : 11753 - 11764
[10] Singularity and Slow Convergence of the EM algorithm for Gaussian Mixtures
Park, Hyeyoung
Ozeki, Tomoko
[J]. NEURAL PROCESSING LETTERS, 2009, 29 (01) : 45 - 59

← 1 2 →