On the distribution of penalized maximum likelihood estimators: The LASSO, SCAD, and thresholding

被引:105
作者
Poetscher, Benedikt M. [1 ]
Leeb, Hannes [2 ]
机构
[1] Univ Vienna, Dept Stat, A-1010 Vienna, Austria
[2] Yale Univ, Dept Stat, New Haven, CT 06520 USA
关键词
Penalized maximum likelihood; LASSO; SCAD; Thresholding; Post-model-selection estimator; Finite-sample distribution; Asymptotic distribution; Oracle property; Estimation of distribution; Uniform consistency; MODEL-SELECTION ESTIMATORS; ORACLE PROPERTIES; ADAPTIVE LASSO; REGRESSION; BOOTSTRAP; SHRINKAGE; CONSISTENCY; INFERENCE;
D O I
10.1016/j.jmva.2009.06.010
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
070103 [概率论与数理统计]; 140311 [社会设计与社会创新];
摘要
We study the distributions of the LASSO, SCAD, and thresholding estimators, in finite samples and in the large-sample limit. The asymptotic distributions are derived for both the case where the estimators are tuned to perform consistent model selection and for the case where the estimators are tuned to perform conservative model selection. Our findings complement those of Knight and Fu [K. Knight, W. Fu, Asymptotics for lasso-type estimators, Annals of Statistics 28 (2000) 1356-1378] and Fan and Li [J. Fan, R. Li, Variable selection via non-concave penalized likelihood and its oracle properties, journal of the American Statistical Association 96 (2001) 1348-1360]. We show that the distributions are typically highly non-normal regardless of how the estimator is tuned, and that this property persists in large samples. The uniform convergence rate of these estimators is also obtained, and is shown to be slower than n(-1/2) in case the estimator is tuned to perform consistent model selection, An impossibility result regarding estimation of the estimators' distribution function is also provided. (C) 2009 Elsevier Inc. All rights reserved.
引用
收藏
页码:2065 / 2082
页数:18
相关论文
共 27 条
[1]
[Anonymous], 1998, THEORY POINT ESTIMAT
[2]
Bauer P., 1988, STATISTICS-ABINGDON, V19, P39, DOI 10.1080/02331888808802068
[3]
Diagnosing bootstrap success [J].
Beran, R .
ANNALS OF THE INSTITUTE OF STATISTICAL MATHEMATICS, 1997, 49 (01) :1-24
[4]
Understanding WaveShrink: Variance and bias estimation [J].
Bruce, AG ;
Gao, HY .
BIOMETRIKA, 1996, 83 (04) :727-745
[5]
Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[6]
Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[7]
A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135
[8]
Judge GG., 1978, STAT IMPLICATIONS PR
[9]
THE EFFECT OF MODEL SELECTION ON CONFIDENCE-REGIONS AND PREDICTION REGIONS [J].
KABAILA, P .
ECONOMETRIC THEORY, 1995, 11 (03) :537-549
[10]
Knight K, 2000, ANN STAT, V28, P1356