Connection pruning with static and adaptive pruning schedules

被引:23
作者
Prechelt, L
机构
[1] Fakultät für Informatik, Universität Karlsruhe
关键词
empirical study; pruning; early stopping; generalization;
D O I
10.1016/S0925-2312(96)00054-9
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Neural network pruning methods on the level of individual network parameters (e.g. connection weights) can improve generalization, as is shown in this empirical study. However, an open problem in the pruning methods known today (e.g. OBD, OBS, autoprune, epsiprune) is the selection of the number of parameters to be removed in each pruning step (pruning strength). This work presents a pruning method Iprune that automatically adapts the pruning strength to the evolution of weights and loss of generalization during training. The method requires no algorithm parameter adjustment by the user. Results of statistical significance tests comparing autoprune, Iprune, and static networks with early stopping are given, based on extensive experimentation with 14 different problems. The results indicate that training with pruning is often significantly better and rarely significantly worse than training with early stopping without pruning. Furthermore, Iprune is often superior to autoprune (which is superior to OBD) on diagnosis tasks unless severe pruning early in the training process is required.
引用
收藏
页码:49 / 61
页数:13
相关论文
共 18 条
  • [11] MORGAN N, 1990, ADV NEURAL INFORMATI, V630, P2
  • [12] SIMPLIFYING NEURAL NETWORKS BY SOFT WEIGHT-SHARING
    NOWLAN, SJ
    HINTON, GE
    [J]. NEURAL COMPUTATION, 1992, 4 (04) : 473 - 493
  • [13] A quantitative study of experimental evaluations of neural network learning algorithms: Current research practice
    Prechelt, L
    [J]. NEURAL NETWORKS, 1996, 9 (03) : 457 - 462
  • [14] PRECHELT L, 1994, 2194 U KARLSR FAK IN
  • [15] RIEDMILLER M, 1993, 1993 IEEE INTERNATIONAL CONFERENCE ON NEURAL NETWORKS, VOLS 1-3, P586, DOI 10.1109/ICNN.1993.298623
  • [16] TOURETZKY DS, 1990, ADV NEURAL INFORMATI, V2
  • [17] Weigend Andreas S, 1991, Advances in neural information processing systems, P875
  • [18] WILLIAMS PM, CSRP312 U SUSS SCH C