Predictive variable selection in generalized linear models

被引:22
作者
Meyer, MC [1 ]
Laud, PW
机构
[1] Univ Georgia, Dept Stat, Athens, GA 30602 USA
[2] Med Coll Wisconsin, Milwaukee, WI 53226 USA
关键词
conjugate prior; Gibbs sampling; L criterion; normal prior; predictive distribution;
D O I
10.1198/016214502388618654
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
Here we extend predictive method for model selection of Laud and Ibrahim to the generalized linear model. This prescription avoids the need to directly specify prior probabilities of models and prior densities for the parameters. Instead, a prior prediction for the response induces the required priors. We propose normal and conjugate priors for generalized linear models, each using a single prior prediction for the mean response to induce suitable priors for each variable-subset model. In this way, an informative prior is used to select a subset of variables. In addition to producing a ranking of models by size of the predictive criterion. the standard deviation of the criterion is used as a calibration number to produce a set of equally good models. A straightforward Markov chain Monte Carlo algorithm is used to accomplish the necessary computations. We illustrate this method with real and simulated datasets and compare results with the Bayes factors and the Akaike information and Bayes information model selection criteria. The simulation results confirm the efficacy of the method, because the correct model is known. An illustrative application demonstrates selection of important predictors of success in identifying the sentinel lymph node during surgical treatment of breast cancer. A forward selection procedure is described to avoid a full search over the 2(18) possible models in this case.
引用
收藏
页码:859 / 871
页数:13
相关论文
共 38 条
[1]  
AHRENDT GM, 2001, UNPUB DOES BREAST TU
[2]  
Aitchison J., 1975, Statistical Prediction Analysis
[3]  
Akaike H., 1973, 2 INT S INFORM THEOR, P267, DOI [DOI 10.1007/978-1-4612-1694-0_15, 10.1007/978-1-4612-1694-0_15]
[4]   BAYESIAN-ANALYSIS OF BINARY AND POLYCHOTOMOUS RESPONSE DATA [J].
ALBERT, JH ;
CHIB, S .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (422) :669-679
[5]  
BARNDORFFNIELSE.O, 1978, INFORMATION EXPONENT
[6]   A new perspective on priors for generalized linear models [J].
Bedrick, EJ ;
Christensen, R ;
Johnson, W .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1996, 91 (436) :1450-1460
[7]   Power prior distributions for generalized linear models [J].
Chen, MH ;
Ibrahim, JG ;
Shao, QM .
JOURNAL OF STATISTICAL PLANNING AND INFERENCE, 2000, 84 (1-2) :121-137
[8]   Prior elicitation, variable selection and Bayesian computation for logistic regression models [J].
Chen, MH ;
Ibrahim, JG ;
Yiannoutsos, C .
JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-STATISTICAL METHODOLOGY, 1999, 61 :223-242
[9]   Analysis of multivariate probit models [J].
Chib, S ;
Greenberg, E .
BIOMETRIKA, 1998, 85 (02) :347-361
[10]  
Damien P, 1999, J ROY STAT SOC B, V61, P331