An iterative pruning algorithm for feedforward neural networks

被引:204
作者
Castellano, G
Fanelli, AM
Pelillo, M
机构
[1] UNIV BARI, DIPARTIMENTO INFORMAT, I-70126 BARI, ITALY
[2] UNIV CA FOSCARI VENEZIA, DIPARTIMENTO MATEMAT APPLICATA & INFORMAT, I-30173 VENICE, ITALY
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1997年 / 8卷 / 03期
关键词
feedforward neural networks; generalization; hidden neurons; iterative methods; least-squares methods; network pruning; pattern recognition; structure simplification;
D O I
10.1109/72.572092
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The problem of determining the proper size of an artificial neural network is recognized to be crucial, especially for its practical implications in such important issues as learning and generalization, One popular approach tackling this problem is commonly known as pruning and consists of training a larger than necessary network and then removing unnecessary weights/nodes, In this paper, a new pruning method is developed, based on the idea of iteratively eliminating units and adjusting the remaining weights in such a way that the network performance does not worsen over the entire training set, The pruning problem is formulated in terms of solving a system of linear equations, and a very efficient conjugate gradient algorithm is used for solving it, in the least-squares sense, The algorithm also provides a simple criterion for choosing the units to be removed, which has proved to work well in practice. The results obtained over various test problems demonstrate the effectiveness of the proposed approach.
引用
收藏
页码:519 / 531
页数:13
相关论文
共 55 条
[11]  
Chauvin Y., 1990, Neural Networks. EURASIP Workshop 1990 Proceedings, P46
[12]  
Chauvin Y., 1989, ADV NEURAL INFORMATI, P519
[13]  
Chung F. L., 1992, International Journal of Neural Systems, V3, P301, DOI 10.1142/S0129065792000231
[14]  
Denker J., 1987, Complex Systems, V1, P877
[15]   SURVEY OF SPARSE-MATRIX RESEARCH [J].
DUFF, IS .
PROCEEDINGS OF THE IEEE, 1977, 65 (04) :500-535
[16]   DETERMINING AND IMPROVING THE FAULT-TOLERANCE OF MULTILAYER PERCEPTRONS IN A PATTERN-RECOGNITION APPLICATION [J].
EMMERSON, MD ;
DAMPER, RI .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (05) :788-793
[17]  
Fahlman S., 1990, ADV NEURAL INFORMATI, V2, P524
[18]  
Gallant S. I., 1986, Eighth International Conference on Pattern Recognition. Proceedings (Cat. No.86CH2342-4), P849
[19]  
Golub GH, 1989, MATRIX COMPUTATIONS
[20]  
Hassibi Babak, 1992, P ADV NEUR INF PROC, V5