A PRACTICAL BAYESIAN FRAMEWORK FOR BACKPROPAGATION NETWORKS

被引:2755
作者
MACKAY, DJC [1 ]
机构
[1] CALTECH 139-74, COMPUTAT & NEURAL SYST, PASADENA, CA 91125 USA
关键词
D O I
10.1162/neco.1992.4.3.448
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
A quantitative and practical Bayesian framework is described for learning of mappings in feedforward networks. The framework makes possible (1) objective comparisons between solutions using alternative network architectures, (2) objective stopping rules for network pruning or growing procedures, (3) objective choice of magnitude and type of weight decay terms or additive regularizers (for penalizing large weights, etc.), (4) a measure of the effective number of well-determined parameters in a model, (5) quantified estimates of the error bars on network parameters and on network output, and (6) objective comparisons with alternative learning and interpolation models such as splines and radial basis functions. The Bayesian "evidence" automatically embodies "Occam's razor," penalizing overflexible and overcomplex models. The Bayesian approach helps detect poor underlying assumptions in learning models. For learning models well matched to a problem, a good correlation between generalization ability and the Bayesian evidence is obtained.
引用
收藏
页码:448 / 472
页数:25
相关论文
共 23 条
  • [1] The Vapnik-Chervonenkis Dimension: Information versus Complexity in Learning
    Abu-Mostafa, Yaser S.
    [J]. NEURAL COMPUTATION, 1989, 1 (03) : 312 - 317
  • [2] BISHOP CM, 1992, IN PRESS NEURAL COMP
  • [3] Denker JS., 1991, ADV NEURAL INFORM PR, P853, DOI DOI 10.5555/2986766.2986882
  • [4] GULL SF, 1989, FUND THEOR, V36, P53
  • [5] HANSON R, 1991, NASA TR FIA9012701 A
  • [6] HAUSSLER D, 1991, 4TH P COLT WORKSH
  • [7] Hinton G., 1986, PARALLEL DISTRIBUTED, V1, P282, DOI DOI 10.1234/12345678
  • [8] Generalizing Smoothness Constraints from Discrete Samples
    Ji, Chuanyi
    Snapp, Robert R.
    Psaltis, Demetri
    [J]. NEURAL COMPUTATION, 1990, 2 (02) : 188 - 197
  • [9] LeCun Y, 1990, ADV NEURAL INFORM PR, P598
  • [10] LEE WT, 1991, TREE915 PURD U