XOR has no local minima: A case study in neural network error surface analysis

被引:45
作者
Hamey, LGC [1 ]
机构
[1] Macquarie Univ, Dept Comp, Sydney, NSW 2109, Australia
关键词
feedforward nets; error surface; local minimum; XOR; exclusive-or;
D O I
10.1016/S0893-6080(97)00134-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a case study of the analysis of local minima in feedforward neural networks, Firstly, a new methodology for analysis is presented, based upon consideration of trajectories through weight space by which a training algorithm might escape a hypothesized local minimum. This analysis method is then applied to the well known XOR (exclusive-or) problem, which has previously been considered to exhibit local minima, The analysis proves the absence of local minima, eliciting significant aspects of the structure of the error surface. The present work is important for the study of the existence of local minima in feedforward neural networks, and also for the development of training algorithms which avoid or escape entrapment in local minima. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:669 / 681
页数:13
相关论文
共 38 条
  • [1] [Anonymous], 1991, NEURAL COMPUTATION B
  • [2] AUER P, 1996, NCTR96030 NEUR U LON
  • [3] NEURAL NETWORKS AND PRINCIPAL COMPONENT ANALYSIS - LEARNING FROM EXAMPLES WITHOUT LOCAL MINIMA
    BALDI, P
    HORNIK, K
    [J]. NEURAL NETWORKS, 1989, 2 (01) : 53 - 58
  • [4] Approximation of Boolean Functions by Sigmoidal Networks: Part I: XOR and Other Two-Variable Functions
    Blum, E. K.
    [J]. NEURAL COMPUTATION, 1989, 1 (04) : 532 - 540
  • [5] BACK PROPAGATION FAILS TO SEPARATE WHERE PERCEPTRONS SUCCEED
    BRADY, ML
    RAGHAVAN, R
    SLAWNY, J
    [J]. IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS, 1989, 36 (05): : 665 - 674
  • [6] TERMINAL REPELLER UNCONSTRAINED SUBENERGY TUNNELING (TRUST) FOR FASTGLOBAL OPTIMIZATION
    CETIN, BC
    BARHEN, J
    BURDICK, JW
    [J]. JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1993, 77 (01) : 97 - 126
  • [7] CETIN BC, 1993, P IEEE INT C NEUR NE, V2, P836
  • [8] CHAUVIN Y, 1990, ADV NEURAL INFORMATI, V2, P642
  • [9] Chauvin Y., 1989, ADV NEURAL INFORMATI, P519
  • [10] DARKEN C, 1992, ADV NEUR IN, V4, P1009