ON THE GEOMETRY OF FEEDFORWARD NEURAL-NETWORK ERROR SURFACES

被引:63
作者
CHEN, AM [1 ]
LU, HM [1 ]
HECHTNIELSEN, R [1 ]
机构
[1] HNC INC,SAN DIEGO,CA
关键词
D O I
10.1162/neco.1993.5.6.910
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Many feedforward neural network architectures have the property that their overall input-output function is unchanged by certain weight permutations and sign flips. In this paper, the geometric structure of these equioutput weight space transformations is explored for the case of multilayer perceptron networks with tanh activation functions (similar results hold for many other types of neural networks). It is shown that these transformations form an algebraic group isomorphic to a direct product of Weyl groups. Results concerning the root spaces of the Lie algebras associated with these Weyl groups are then used to derive sets of simple equations for minimal sufficient search sets in weight space. These sets, which take the geometric forms of a wedge and a cone, occupy only a minute fraction of the volume of weight space. A separate analysis shows that large numbers of copies of a network performance function optimum weight vector are created by the action of the equioutput transformation group and that these copies all lie on the same sphere. Some implications of these results for learning are discussed.
引用
收藏
页码:910 / 927
页数:18
相关论文
共 22 条
  • [1] BACHMAN G, 1966, FUNCTIONAL ANAL
  • [2] SELF-ORGANIZING NEURAL NETWORK THAT DISCOVERS SURFACES IN RANDOM-DOT STEREOGRAMS
    BECKER, S
    HINTON, GE
    [J]. NATURE, 1992, 355 (6356) : 161 - 163
  • [3] Improving the Generalization Properties of Radial Basis Function Neural Networks
    Bishop, Chris
    [J]. NEURAL COMPUTATION, 1991, 3 (04) : 579 - 588
  • [4] BISHOP CM, 1990, P INT NEURAL NETWORK, V2, P749
  • [5] Broomhead D. S., 1988, Complex Systems, V2, P321
  • [6] ARTMAP - SUPERVISED REAL-TIME LEARNING AND CLASSIFICATION OF NONSTATIONARY DATA BY A SELF-ORGANIZING NEURAL NETWORK
    CARPENTER, GA
    GROSSBERG, S
    REYNOLDS, JH
    [J]. NEURAL NETWORKS, 1991, 4 (05) : 565 - 588
  • [7] CHEN AM, 1991, 2ND IEE INT C ART NE, P1
  • [8] Layered Neural Networks with Gaussian Hidden Units as Universal Approximations
    Hartman, Eric J.
    Keeler, James D.
    Kowalski, Jacek M.
    [J]. NEURAL COMPUTATION, 1990, 2 (02) : 210 - 215
  • [9] Hecht-Nielsen R., 1992, NEURAL NETWORKS PERC, P65
  • [10] Hecht-Nielsen R., 1991, NEUROCOMPUTING