Variational free energy and the Laplace approximation

被引:601
作者
Friston, Karl J.
Mattout, Jeremie
Trujillo-Barreto, Nelson
Ashburner, John
Penny, Will
机构
[1] UCL, Inst Neurol, Wellcome Dept Imaging Neurosci, London WC1N 3BG, England
[2] Cuban Neurosci Ctr, Havana, Cuba
基金
英国惠康基金;
关键词
Variational Bayes; free energy; expectation maximisation; restricted maximum likelihood; model selection; automatic relevance determination; relevance vector machines;
D O I
10.1016/j.neuroimage.2006.08.035
中图分类号
Q189 [神经科学];
学科分类号
071006 ;
摘要
This note derives the variational free energy under the Laplace approximation, with a focus on accounting for additional model complexity induced by increasing the number of model parameters. This is relevant when using the free energy as an approximation to the log-evidence in Bayesian model averaging and selection. By setting restricted maximum likelihood (ReML) in the larger context of variational learning and expectation maximisation (EM), we show how the ReML objective function can be adjusted to provide an approximation to the log-evidence for a particular model. This means ReML can be used for model selection, specifically to select or compare models with different covariance components. This is useful in the context of hierarchical models because it enables a principled selection of priors that, under simple hyperpriors, can be used for automatic model selection and relevance determination (ARD). Deriving the ReML objective function, from basic variational principles, discloses the simple relationships among Variational Bayes, EM and ReML. Furthermore, we show that EM is formally identical to a full variational treatment when the precisions are linear in the hyperparameters. Finally, we also consider, briefly, dynamic models and how these inform the regularisation of free energy ascent schemes, like EM and ReML. (c) 2006 Elsevier Inc. All rights reserved.
引用
收藏
页码:220 / 234
页数:15
相关论文
共 44 条
[1]  
ADAMI KZ, 2003, PHYSTAT2003 SLAC STA
[2]  
[Anonymous], 1998, LEARNING GRAPHICAL M
[3]  
[Anonymous], PRACTICAL GUIDE NEUR
[4]   Unified segmentation [J].
Ashburner, J ;
Friston, KJ .
NEUROIMAGE, 2005, 26 (03) :839-851
[5]  
BEAL MJ, 2003, BAYESIAN STAT, pCH7
[6]  
BEAL MJ, 1998, VARIATIONAL ALGORITH, P58
[7]  
Berger JO., 1985, STAT DECISION THEORY, DOI DOI 10.1007/978-1-4757-4286-2
[8]  
BISHOP CM, 1999, LEARNING GRAPHICAL M
[9]   Dynamic causal modeling of evoked responses in EEG and MEG [J].
David, Olivier ;
Kiebel, Stefan J. ;
Harrison, Lee M. ;
Mattout, Jrmie ;
Kilner, James M. ;
Friston, Karl J. .
NEUROIMAGE, 2006, 30 (04) :1255-1272
[10]   MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].
DEMPSTER, AP ;
LAIRD, NM ;
RUBIN, DB .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38