USING RANDOM WEIGHTS TO TRAIN MULTILAYER NETWORKS OF HARD-LIMITING UNITS

被引：27

作者：

BARTLETT, PL

DOWNS, T

机构：

[1] Department of Electrical Engineering, University of Queensland, Queensland

来源：

IEEE TRANSACTIONS ON NEURAL NETWORKS | 1992年 / 3卷 / 02期

关键词：

D O I：

10.1109/72.125861

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

A gradient descent algorithm suitable for training multilayer feedforward networks of processing units with hard-limiting output functions is presented. The conventional back-propagation algorithm cannot be applied to networks whose processing units have hard-limiting input-output characteristics because the required derivatives are not available. However, if the network weights are random variables with smooth distribution functions, the probability of a hard-limiting unit taking one of its two possible values is a continuously differentiable function. In this paper, we use this to develop an algorithm similar to back-propagation, but for the hard-limiting case. It is shown that the computational framework of this algorithm is similar to standard back-propagation, but there is an additional computational expense involved in the estimation of gradients. We give upper bounds on this estimation penalty. Two examples are given which indicate that, when this algorithm is used to train networks of hard-limiting units, its performance is similar to that of conventional back-propagation applied to networks of units with sigmoidal characteristics.

引用

页码：202 / 210

页数：9

共 19 条

[1] ACKLEY DH, 1985, COGNITIVE SCI, V9, P147
[2] [Anonymous], 1987, LEARNING INTERNAL RE
[3] What Size Net Gives Valid Generalization?
Baum, Eric B.
Haussler, David
[J]. NEURAL COMPUTATION, 1989, 1 (01) : 151 - 160
[4] THE RECENT EXCITEMENT ABOUT NEURAL NETWORKS
CRICK, F
[J]. NATURE, 1989, 337 (6203) : 129 - 132
[5] DOWNS T, 1991, 2ND AUSTR C NEUR NET
[6] GIBSON GJ, 1989, MAY P INT C AC SPEEC, V2, P1183
[7] HAUSSLER D, 1990, UCSCCRL9102 U CAL TE
[8] CONNECTIONIST LEARNING PROCEDURES
HINTON, GE
[J]. ARTIFICIAL INTELLIGENCE, 1989, 40 (1-3) : 185 - 234
[9] KOTZ S, 1985, ENCY STATISTICAL SCI, V6
[10] Kramer A. H., 1989, ADV NEURAL INFORMATI, P40

← 1 2 →