USING ADDITIVE NOISE IN BACK-PROPAGATION TRAINING

被引:240
作者
HOLMSTROM, L
KOISTINEN, P
机构
[1] Rolf Nevanlinna Institute, University of Helsinki, 00510 Helsinki
来源
IEEE TRANSACTIONS ON NEURAL NETWORKS | 1992年 / 3卷 / 01期
关键词
D O I
10.1109/72.105415
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We discuss the possibility of improving the generalization capability of a neural network by introducing additive noise to the training samples. The network considered is a feedforward layered neural network trained with the back-propagation algorithm. Back-propagation training is viewed as nonlinear least-squares regression and the additive noise is interpreted as generating a kernel estimate of the probability density that describes the training vector distribution. Two specific application types are considered: pattern classifier networks and estimation of a nonstochastic mapping from data that are corrupted by measurement errors. We do not prove that the introduction of additive noise to the training vectors always improves network generalization. However, our analysis suggests mathematically justified rules for choosing the characteristics of noise if additive noise is used in training. Further, using results of mathematical statistics we establish various asymptotic consistency results for the proposed method. We also report numerical simulations that give support to the applicability of the suggested training method.
引用
收藏
页码:24 / 38
页数:15
相关论文
共 36 条
[1]  
BILLINGLSEY P, 1979, PROBABILITY MEASURE
[2]   ESTIMATION OF A MULTIVARIATE DENSITY [J].
CACOULLOS, T .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1966, 18 (02) :179-+
[3]   CONSISTENT CROSS-VALIDATED DENSITY-ESTIMATION [J].
CHOW, YS ;
GEMAN, S ;
WU, LD .
ANNALS OF STATISTICS, 1983, 11 (01) :25-38
[4]  
Devijver P., 1982, PATTERN RECOGNITION
[5]  
Devroye L., 1987, COURSE DENSITY ESTIM
[6]  
DEVROYE L, 1985, NONPARAMETRIC DENSIT
[7]  
DUIN RPW, 1976, IEEE T COMPUT, V25, P1175, DOI 10.1109/TC.1976.1674577
[8]   LEARNING THE HIDDEN STRUCTURE OF SPEECH [J].
ELMAN, JL ;
ZIPSER, D .
JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, 1988, 83 (04) :1615-1626
[9]   TRAINING WITH NOISE AND THE STORAGE OF CORRELATED PATTERNS IN A NEURAL NETWORK MODEL [J].
GARDNER, EJ ;
STROUD, N ;
WALLACE, DJ .
JOURNAL OF PHYSICS A-MATHEMATICAL AND GENERAL, 1989, 22 (12) :2019-2030
[10]   INFERENCE OF A RULE BY A NEURAL NETWORK WITH THERMAL NOISE [J].
GYORGYI, G .
PHYSICAL REVIEW LETTERS, 1990, 64 (24) :2957-2960