Sparse estimators and the oracle property, or the return of Hodges' estimator

被引:126
作者
Leeb, Hannes [2 ]
Poetscher, Benedikt M. [1 ]
机构
[1] Univ Vienna, Dept Stat, A-1010 Vienna, Austria
[2] Yale Univ, Dept Stat, New Haven, CT 06520 USA
关键词
oracle property; sparsity; penalized maximum likelihood; penalized least squares; Hodges' estimator; SCAD; lasso; bridge estimator; hard-thresholding; maximal risk; maximal absolute bias; nonuniform limits;
D O I
10.1016/j.jeconom.2007.05.017
中图分类号
F [经济];
学科分类号
02 [经济学];
摘要
We point out some pitfalls related to the concept of an oracle property as used in Fan and Li [2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348-1360; 2002. Variable selection for Cox's proportional hazards model and frailty model. Annals of Statistics 30, 74-99; 2004. New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of the American Statistical Association 99, 710-723] which are reminiscent of the well-known pitfalls related to Hodges' estimator. The oracle property is often a consequence of sparsity of an estimator. We show that any estimator satisfying a sparsity property has maximal risk that converges to the supremum of the loss function; in particular, the maximal risk diverges to infinity whenever the loss function is unbounded. For ease of presentation the result is set in the framework of a linear regression model, but generalizes far beyond that setting. In a Monte Carlo study we also assess the extent of the problem in finite samples for the smoothly clipped absolute deviation (SCAD) estimator introduced in Fan and Li [2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348-1360]. We find that this estimator can perform rather poorly in finite samples and that its worst-case performance relative to maximum likelihood deteriorates with increasing sample size when the estimator is tuned to sparsity. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:201 / 211
页数:11
相关论文
共 27 条
[1]
[Anonymous], FDN STAT INFERENCE
[2]
Covariate selection for semiparametric hazard function regression models [J].
Bunea, F ;
McKeague, IW .
JOURNAL OF MULTIVARIATE ANALYSIS, 2005, 92 (01) :186-204
[3]
Consistent covariate selection and post model selection inference in semiparametric regression [J].
Bunea, F .
ANNALS OF STATISTICS, 2004, 32 (03) :898-927
[4]
Vairiable selection for multivariate failure time data [J].
Cai, JW ;
Fan, JQ ;
Li, RZ ;
Zhou, HB .
BIOMETRIKA, 2005, 92 (02) :303-316
[5]
New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis [J].
Fan, JQ ;
Li, R .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (467) :710-723
[6]
Nonconcave penalized likelihood with a diverging number of parameters [J].
Fan, JQ ;
Peng, H .
ANNALS OF STATISTICS, 2004, 32 (03) :928-961
[7]
Fan JQ, 2002, ANN STAT, V30, P74
[8]
Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[9]
THE RISK INFLATION CRITERION FOR MULTIPLE-REGRESSION [J].
FOSTER, DP ;
GEORGE, EI .
ANNALS OF STATISTICS, 1994, 22 (04) :1947-1975
[10]
A STATISTICAL VIEW OF SOME CHEMOMETRICS REGRESSION TOOLS [J].
FRANK, IE ;
FRIEDMAN, JH .
TECHNOMETRICS, 1993, 35 (02) :109-135