Penalized Regression, Standard Errors, and Bayesian Lassos

被引:384
作者
Kyung, Minjung [1 ]
Gill, Jeff [2 ]
Ghosh, Malay [1 ]
Casella, George [1 ]
机构
[1] Univ Florida, Dept Stat, Gainesville, FL 32611 USA
[2] Washington Univ, Dept Polit Sci, St Louis, MO USA
来源
BAYESIAN ANALYSIS | 2010年 / 5卷 / 02期
基金
美国国家科学基金会;
关键词
Hierarchical Models; Gibbs Sampling; Geometric Ergodicity; Variable Selection; VARIABLE SELECTION; MODEL SELECTION; GIBBS; BOOTSTRAP;
D O I
10.1214/10-BA607
中图分类号
O1 [数学];
学科分类号
0701 ; 070101 ;
摘要
Penalized regression methods for simultaneous variables election and coefficient estimation, especially those based on the lasso of Tibshirani (1996), have received a great deal of attention in recent years, mostly through frequentist models. Properties such as consistency have been studied, and are achieved by different lasso variations. Here we look at a fully Bayesian formulation of the problem, which is flexible enough to encompass most versions of the lasso that have been previously considered. The advantages of the hierarchical Bayesian formulations are many. In addition to the usual ease-of-interpretation of hierarchical models, the Bayesian formulation produces valid standard errors (which can be problematic for the frequentist lasso), and is based on a geometrically ergodic Markov chain. We compare the performance of the Bayesian lassos to their frequentist counterparts using simulations, data sets that previous lasso papers have used, and a difficult modeling problem for predicting the collapse of governments around the world. In terms of prediction mean squared error, the Bayesian lasso performance is similar to and, in some cases, better than, the frequentist lasso.
引用
收藏
页码:369 / 411
页数:43
相关论文
共 42 条
[1]  
ANDREWS DF, 1974, J ROY STAT SOC B MET, V36, P99
[2]  
[Anonymous], 2006, Journal of the Royal Statistical Society, Series B
[3]   Gene selection using a two-level hierarchical Bayesian model [J].
Bae, K ;
Mallick, BK .
BIOINFORMATICS, 2004, 20 (18) :3423-3430
[4]   ESTIMATED SAMPLING DISTRIBUTIONS - THE BOOTSTRAP AND COMPETITORS [J].
BERAN, R .
ANNALS OF STATISTICS, 1982, 10 (01) :212-225
[5]  
Casella G, 2001, Biostatistics, V2, P485, DOI 10.1093/biostatistics/2.4.485
[6]  
CASELLA G, 2009, RECONCILING MODEL SE
[7]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[8]   1977 RIETZ LECTURE - BOOTSTRAP METHODS - ANOTHER LOOK AT THE JACKKNIFE [J].
EFRON, B .
ANNALS OF STATISTICS, 1979, 7 (01) :1-26
[9]  
ESTY DC, 1999, 5 SCI APPL INT CORP
[10]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360