DETERMINING MODEL STRUCTURE FOR NEURAL MODELS BY NETWORK STRIPPING

被引:48
作者
BHAT, NV
MCAVOY, TJ
机构
[1] Department of Chemical Engineering, University of Maryland, College Park
基金
美国国家科学基金会;
关键词
D O I
10.1016/0098-1354(92)80047-D
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Currently, the backpropagation neural network (BPN) is the most widely used network paradigm for solving chemical engineering problems. Despite its wide usage, there is no definite methodology for determining the network structure for a particular mapping application. The lack of a network procedure has resulted in a tendency to use networks much larger than needed. Such neural models have excessive parameters or weights and often memorize the training data which causes difficulty in extrapolation to unseen data. Hence it is important to use networks which have the simplest possible structure, i.e. use the minimum number of weights and nodes. In this paper, a detailed method to strip a BPN to its essential weights and nodes is proposed. The algorithm, called the StripNet algorithm, results in a network of lesser complexity in terms of its interconnections. Such networks reduce the risk of overfitting the data and have better generalization properties. The paper also explains how one can probe into such a stripped network and gain a deeper insight into the knowledge that has been captured. The StripNet algorithm provides a systematic procedure for determining the topology of the network for any application.
引用
收藏
页码:271 / 281
页数:11
相关论文
共 22 条
  • [1] AKAIKE H, 1980, 2ND P INT S INF THEO, P267
  • [2] [Anonymous], 1968, SPECTRAL ANAL ITS AP
  • [3] Barron A.R., 1984, SELF ORG METHODS MOD, P87
  • [4] Barron A.R., 1988, 20TH INTERFACE 88 S, P192
  • [5] USE OF NEURAL NETS FOR DYNAMIC MODELING AND CONTROL OF CHEMICAL PROCESS SYSTEMS
    BHAT, N
    MCAVOY, TJ
    [J]. COMPUTERS & CHEMICAL ENGINEERING, 1990, 14 (4-5) : 573 - 583
  • [6] Box G.E.P., 1976, TIME SERIES ANAL
  • [7] CHATFIELD C, 1975, ANAL TIME SERIES
  • [8] Cybenko G., 1989, Mathematics of Control, Signals, and Systems, V2, P303, DOI 10.1007/BF02551274
  • [9] Goodwin G., 1977, DYNAMIC SYSTEM IDENT
  • [10] HAGIWARA M, 1990, P IJCNN 1990, P1625