ON THE GEOMETRY OF FEEDFORWARD NEURAL-NETWORK ERROR SURFACES

被引：63

作者：

CHEN, AM ^{[1
]}

LU, HM ^{[1
]}

HECHTNIELSEN, R ^{[1
]}

机构：

[1] HNC INC,SAN DIEGO,CA

来源：

NEURAL COMPUTATION | 1993年 / 5卷 / 06期

关键词：

D O I：

10.1162/neco.1993.5.6.910

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

Many feedforward neural network architectures have the property that their overall input-output function is unchanged by certain weight permutations and sign flips. In this paper, the geometric structure of these equioutput weight space transformations is explored for the case of multilayer perceptron networks with tanh activation functions (similar results hold for many other types of neural networks). It is shown that these transformations form an algebraic group isomorphic to a direct product of Weyl groups. Results concerning the root spaces of the Lie algebras associated with these Weyl groups are then used to derive sets of simple equations for minimal sufficient search sets in weight space. These sets, which take the geometric forms of a wedge and a cone, occupy only a minute fraction of the volume of weight space. A separate analysis shows that large numbers of copies of a network performance function optimum weight vector are created by the action of the equioutput transformation group and that these copies all lie on the same sphere. Some implications of these results for learning are discussed.

引用

页码：910 / 927

页数：18

共 22 条

[1] BACHMAN G, 1966, FUNCTIONAL ANAL
[2] SELF-ORGANIZING NEURAL NETWORK THAT DISCOVERS SURFACES IN RANDOM-DOT STEREOGRAMS
BECKER, S
HINTON, GE
[J]. NATURE, 1992, 355 (6356) : 161 - 163
[3] Improving the Generalization Properties of Radial Basis Function Neural Networks
Bishop, Chris
[J]. NEURAL COMPUTATION, 1991, 3 (04) : 579 - 588
[4] BISHOP CM, 1990, P INT NEURAL NETWORK, V2, P749
[5] Broomhead D. S., 1988, Complex Systems, V2, P321
[6] ARTMAP - SUPERVISED REAL-TIME LEARNING AND CLASSIFICATION OF NONSTATIONARY DATA BY A SELF-ORGANIZING NEURAL NETWORK
CARPENTER, GA
GROSSBERG, S
REYNOLDS, JH
[J]. NEURAL NETWORKS, 1991, 4 (05) : 565 - 588
[7] CHEN AM, 1991, 2ND IEE INT C ART NE, P1
[8] Layered Neural Networks with Gaussian Hidden Units as Universal Approximations
Hartman, Eric J.
Keeler, James D.
Kowalski, Jacek M.
[J]. NEURAL COMPUTATION, 1990, 2 (02) : 210 - 215
[9] Hecht-Nielsen R., 1992, NEURAL NETWORKS PERC, P65
[10] Hecht-Nielsen R., 1991, NEUROCOMPUTING

← 1 2 3 →