Estimation and Accuracy After Model Selection

被引:229
作者
Efron, Bradley [1 ]
机构
[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA
基金
美国国家科学基金会;
关键词
Bootstrap smoothing; C-p; Importance sampling; ABC intervals; Model averaging; Bagging; Lasso; EXPONENTIAL-FAMILIES; CONFIDENCE-INTERVALS; STANDARD ERRORS; BOOTSTRAP; LASSO; REGRESSION; STATISTICS; SUPERNOVAE; JACKKNIFE; INFERENCE;
D O I
10.1080/01621459.2013.823775
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
070103 [概率论与数理统计]; 140311 [社会设计与社会创新];
摘要
Classical statistical theory ignores model selection in assessing estimation accuracy. Here we consider bootstrap methods for computing standard errors and confidence intervals that take model selection into account. The methodology involves bagging, also known as bootstrap smoothing, to tame the erratic discontinuities of selection-based estimators. A useful new formula for the accuracy of bagging then provides standard errors for the smoothed estimators. Two examples, nonparametric and parametric, are carried through in detail: a regression model where the choice of degree (linear, quadratic, cubic, horizontal ellipsis ) is determined by the C-p criterion and a Lasso-based estimation problem.
引用
收藏
页码:991 / 1007
页数:17
相关论文
共 29 条
[1]
[Anonymous], 1993, An introduction to the bootstrap
[2]
Berk R., 2012, ANN STAT
[3]
Breiman L, 1996, MACH LEARN, V24, P123, DOI 10.1023/A:1018054314350
[4]
Model selection: An integral part of inference [J].
Buckland, ST ;
Burnham, KP ;
Augustin, NH .
BIOMETRICS, 1997, 53 (02) :603-618
[5]
Bühlmann P, 2002, ANN STAT, V30, P927
[6]
Buja A, 2006, STAT SINICA, V16, P323
[7]
Bootstrapping Lasso Estimators [J].
Chatterjee, A. ;
Lahiri, S. N. .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2011, 106 (494) :608-625
[8]
DICICCIO T, 1992, BIOMETRIKA, V79, P231, DOI 10.1093/biomet/79.2.231
[9]
Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[10]
EFRON B, 1992, J ROY STAT SOC B MET, V54, P83