Classical model selection via simulated annealing

被引:44
作者
Brooks, SP
Friel, N
King, R
机构
[1] Univ Cambridge, Stat Lab, Cambridge CB3 0WB, England
[2] Univ Glasgow, Glasgow, Lanark, Scotland
关键词
autoregressive time series; capture-recapture; classical statistics; information criteria; logistic regression; Markov chain Monte Carlo methods; optimization; reversible jump Markov chain Monte Carlo sampling; variable selection;
D O I
10.1111/1467-9868.00399
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The classical approach to statistical analysis is usually based upon finding values for model parameters that maximize the likelihood function. Model choice in this context is often also based on the likelihood function, but with the addition of a penalty term for the number of parameters. Though models may be compared pairwise by using likelihood ratio tests for example, various criteria such as the Akaike information criterion have been proposed as alternatives when multiple models need to be compared. In practical terms, the classical approach to model selection usually involves maximizing the likelihood function associated with each competing model and then calculating the corresponding criteria value(s). However, when large numbers of models are possible, this quickly becomes infeasible unless a method that simultaneously maximizes over both parameter and model space is available. We propose an extension to the traditional simulated annealing algorithm that allows for moves that not only change parameter values but also move between competing models. This transdimensional simulated annealing algorithm can therefore be used to locate models and parameters that minimize criteria such as the Akaike information criterion, but within a single algorithm, removing the need for large numbers of simulations to be run. We discuss the implementation of the transdimensional simulated annealing algorithm and use simulation studies to examine its performance in realistically complex modelling situations. We illustrate our ideas with a pedagogic example based on the analysis of an autoregressive time series and two more detailed examples: one on variable selection for logistic regression and the other on model selection for the analysis of integrated recapture-recovery data.
引用
收藏
页码:503 / 520
页数:18
相关论文
共 45 条
[1]   NEW LOOK AT STATISTICAL-MODEL IDENTIFICATION [J].
AKAIKE, H .
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 1974, AC19 (06) :716-723
[2]  
ANDRIEU C, 2001, IN PRESS NEUR COMPUT
[3]  
[Anonymous], 1996, BAYESIAN ANAL STAT E
[4]  
[Anonymous], 1997, TABU SEARCH
[5]  
Besag J, 1989, ANAL STAT INFORM, P491
[6]  
BOX GEP, 1970, TIME SERIES ANAL FOR
[7]  
Brent R.P., 1973, ALGORITHMS MINIMIZIN
[8]   Efficient construction of reversible jump Markov chain Monte Carlo proposal distributions [J].
Brooks, SP ;
Giudici, P ;
Roberts, GO .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 2003, 65 :3-39
[9]  
Brooks SP, 1998, J ROY STAT SOC D-STA, V47, P69, DOI 10.1111/1467-9884.00117
[10]   Finite mixture models for proportions [J].
Brooks, SP ;
Morgan, BJT ;
Ridout, MS ;
Pack, SE .
BIOMETRICS, 1997, 53 (03) :1097-1115