Robust full Bayesian learning for radial basis networks

被引:59
作者
Andrieu, C [1 ]
de Freitas, N
Doucet, A
机构
[1] Univ Cambridge, Dept Engn, Cambridge CB2 2PZ, England
[2] Univ Calif Berkeley, Dept Comp Sci, Berkeley, CA 94720 USA
关键词
D O I
10.1162/089976601750541831
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
We propose a hierarchical full Bayesian model for radial basis networks. This model treats the model dimension (number of neurons), model parameters, regularization parameters, and noise parameters as unknown random variables. We develop a reversible-jump Markov chain Monte Carlo (MCMC) method to perform the Bayesian computation. We find that the results obtained using this method are not only better than the ones reported previously, but also appear to be robust with respect to the prior specification. In addition, we propose a novel and computationally efficient reversible-jump MCMC simulated annealing algorithm to optimize neural networks. This algorithm enables us to maximize the joint posterior distribution of the network parameters and the number of basis function. It performs a global search in the joint space of the parameters and number of parameters, thereby surmounting the problem of local minima to a large extent. We show that by calibrating the full hierarchical Bayesian prior, we can obtain the classical Akaike information criterion, Bayesian information criterion, and minimum description length model selection criteria within a penalized likelihood framework. Finally, we present a geometric convergence theorem for the algorithm with homogeneous transition kernel and a convergence theorem for the reversible-jump MCMC simulated annealing method.
引用
收藏
页码:2359 / 2407
页数:49
相关论文
共 54 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]   Sequential MCMC for Bayesian model selection [J].
Andrieu, C ;
De Freitas, N ;
Doucet, A .
PROCEEDINGS OF THE IEEE SIGNAL PROCESSING WORKSHOP ON HIGHER-ORDER STATISTICS, 1999, :130-134
[3]  
ANDRIEU C, IN PRESS SIGNAL PROC
[4]  
ANDRIEU C, 1999, 346 CUEDFINFENGTR
[5]  
ANDRIEU C, 1998, THESIS U CERGY POINT
[6]  
Andrieu C., 1999, SEQUENTIAL BAYESIAN
[7]  
[Anonymous], 1992, Stochastic Stability of Markov chains
[8]  
[Anonymous], 1987, SIMULATED ANNEALING
[9]  
Bernardo J.M., 2009, Bayesian Theory, V405
[10]   HYBRID MONTE-CARLO SIMULATIONS THEORY AND INITIAL COMPARISON WITH MOLECULAR-DYNAMICS [J].
BRASS, A ;
PENDLETON, BJ ;
CHEN, Y ;
ROBSON, B .
BIOPOLYMERS, 1993, 33 (08) :1307-1315