Theoretical and numerical analysis of learning dynamics near singularity in multilayer perceptrons

被引:13
作者
Guo, Weili [1 ]
Wei, Haikun [1 ]
Zhao, Junsheng [1 ,2 ]
Zhang, Kanjian [1 ]
机构
[1] Southeast Univ, Sch Automat, Key Lab Measurement & Control CSE, Minist Educ, Nanjing 210096, Jiangsu, Peoples R China
[2] Liaocheng Univ, Sch Math Sci, Liaocheng 252059, Peoples R China
基金
国家自然科学基金重大项目; 中国国家自然科学基金; 高等学校博士学科点专项科研基金;
关键词
Multilayer perceptrons; Learning dynamics; Theoretical analysis; Numerical analysis; Singularity;
D O I
10.1016/j.neucom.2014.09.026
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The multilayer perceptron is one of the most widely used neural networks in applications, however, its learning behavior often becomes very slow, which is due to the singularities in the parameter space. In this paper, we analyze the learning dynamics near singularities in multilayer perceptrons by using traditional methods. We obtain the explicit expressions of the averaged learning equations which play a significant role in theoretical and numerical analysis. After obtaining the best approximation on overlap singularity, the stability of overlap singularity is analyzed. Then we take the numerical analysis on singular regions. Real averaged dynamics near the singularities are obtained in comparison with the theoretical learning trajectories near singularity. In the simulation we analyze the averaged learning dynamics, batch mode learning dynamics and on-line learning dynamics, respectively. (C) 2014 Elsevier B.V. All rights reserved.
引用
收藏
页码:390 / 400
页数:11
相关论文
共 18 条
  • [1] Amari S, 2001, IEICE T FUND ELECTR, VE84A, P31
  • [2] AMARI S, 2000, INFORM GEOMETRY
  • [3] Singularities affect dynamics of learning in neuromanifolds
    Amari, Shun-ichi
    Park, Hyeyoung
    Ozeki, Tomoko
    [J]. NEURAL COMPUTATION, 2006, 18 (05) : 1007 - 1065
  • [4] Dynamics of Learning In Hierarchical Models - Singularity and Milnor Attractor
    Amari, Shun-ichi
    Ozeki, Tomoko
    Cousseau, Florent
    Wei, Haikun
    [J]. ADVANCES IN COGNITIVE NEURODYNAMICS (II), 2011, : 3 - 9
  • [5] LEARNING BY ONLINE GRADIENT DESCENT
    BIEHL, M
    SCHWARZE, H
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1995, 28 (03): : 643 - 656
  • [6] Dynamics of learning in multilayer perceptrons near singularities
    Cousseau, Florent
    Ozeki, Tomoko
    Amari, Shun-ichi
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 2008, 19 (08): : 1313 - 1328
  • [7] Local minima and plateaus in hierarchical structures of multilayer perceptrons
    Fukumizu, K
    Amari, S
    [J]. NEURAL NETWORKS, 2000, 13 (03) : 317 - 327
  • [8] A regularity condition of the information matrix of a multilayer perceptron network
    Fukumizu, K
    [J]. NEURAL NETWORKS, 1996, 9 (05) : 871 - 879
  • [9] Online learning dynamics of multilayer perceptrons with unidentifiable parameters
    Park, H
    Inoue, M
    Okada, M
    [J]. JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 2003, 36 (47): : 11753 - 11764
  • [10] Singularity and Slow Convergence of the EM algorithm for Gaussian Mixtures
    Park, Hyeyoung
    Ozeki, Tomoko
    [J]. NEURAL PROCESSING LETTERS, 2009, 29 (01) : 45 - 59