TRAINING WITH NOISE IS EQUIVALENT TO TIKHONOV REGULARIZATION

被引:945
作者
BISHOP, CM
机构
关键词
D O I
10.1162/neco.1995.7.1.108
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
It is well known that the addition of noise to the input data of a neural network during training can, in some circumstances, lead to significant improvements in generalization performance. Previous work has shown that such training with noise is equivalent to a form of regularization in which an extra term is added to the error function. However, the regularization term, which involves second derivatives of the error function, is not bounded below, and so can lead to difficulties if used directly in a learning algorithm based on error minimization. In this paper we show that for the purposes of network training, the regularization term can be reduced to a positive semi-definite form that involves only first derivatives of the network mapping. For a sum-of-squares error function, the regularization term belongs to the class of generalized Tikhonov regularizers. Direct minimization of the regularized error function provides a practical alternative to training with noise.
引用
收藏
页码:108 / 116
页数:9
相关论文
共 8 条
[1]
EXACT CALCULATION OF THE HESSIAN MATRIX FOR THE MULTILAYER PERCEPTRON [J].
BISHOP, C .
NEURAL COMPUTATION, 1992, 4 (04) :494-501
[2]
Improving the Generalization Properties of Radial Basis Function Neural Networks [J].
Bishop, Chris .
NEURAL COMPUTATION, 1991, 3 (04) :579-588
[3]
CURVATURE-DRIVEN SMOOTHING - A LEARNING ALGORITHM FOR FEEDFORWARD NETWORKS [J].
BISHOP, CM .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (05) :882-884
[4]
NEURAL NETWORKS AND THE BIAS VARIANCE DILEMMA [J].
GEMAN, S ;
BIENENSTOCK, E ;
DOURSAT, R .
NEURAL COMPUTATION, 1992, 4 (01) :1-58
[5]
NOISE INJECTION INTO INPUTS IN BACKPROPAGATION LEARNING [J].
MATSUOKA, K .
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS, 1992, 22 (03) :436-440
[6]
CREATING ARTIFICIAL NEURAL NETWORKS THAT GENERALIZE [J].
SIETSMA, J ;
DOW, RJF .
NEURAL NETWORKS, 1991, 4 (01) :67-79
[7]
Tikhonov A.N., 1977, SIAM REV, V21, P266, DOI DOI 10.1137/1021044
[8]
WEBB AR, IEEE T NEURAL NETWOR, V5363, P94