XOR has no local minima: A case study in neural network error surface analysis

被引:45
作者
Hamey, LGC [1 ]
机构
[1] Macquarie Univ, Dept Comp, Sydney, NSW 2109, Australia
关键词
feedforward nets; error surface; local minimum; XOR; exclusive-or;
D O I
10.1016/S0893-6080(97)00134-2
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper presents a case study of the analysis of local minima in feedforward neural networks, Firstly, a new methodology for analysis is presented, based upon consideration of trajectories through weight space by which a training algorithm might escape a hypothesized local minimum. This analysis method is then applied to the well known XOR (exclusive-or) problem, which has previously been considered to exhibit local minima, The analysis proves the absence of local minima, eliciting significant aspects of the structure of the error surface. The present work is important for the study of the existence of local minima in feedforward neural networks, and also for the development of training algorithms which avoid or escape entrapment in local minima. (C) 1998 Elsevier Science Ltd. All rights reserved.
引用
收藏
页码:669 / 681
页数:13
相关论文
共 38 条
[21]  
Jose Stephen., 1989, ADV NEURAL INFORM PR, P177, DOI 10.5555/2987061.2987082
[23]   OPTIMIZATION BY SIMULATED ANNEALING [J].
KIRKPATRICK, S ;
GELATT, CD ;
VECCHI, MP .
SCIENCE, 1983, 220 (4598) :671-680
[24]  
Kolen J. F., 1990, Complex Systems, V4, P269
[25]   COMPLETE SOLUTION OF THE LOCAL MINIMA IN THE XOR PROBLEM [J].
LISBOA, PJG ;
PERANTONIS, SJ .
NETWORK-COMPUTATION IN NEURAL SYSTEMS, 1991, 2 (01) :119-124
[26]  
Luenberger D.G., 1984, LINEAR NONLINEAR PRO
[27]   A SCALED CONJUGATE-GRADIENT ALGORITHM FOR FAST SUPERVISED LEARNING [J].
MOLLER, MF .
NEURAL NETWORKS, 1993, 6 (04) :525-533
[28]  
POSTON T, 1991, JUL P IEEE IJCNN91 S, V2, P173
[29]  
Rumelhart D.E., 1987, Parallel Distributed Processing: Explorations in the Microstructure of Cognition, P318
[30]  
Sangiovanni-Vincentelli 1 988]., 1988, Proceedings of The Conference on Neural Information Processing Systems, P40