Sparse estimators and the oracle property, or the return of Hodges' estimator

被引：126

作者：

Leeb, Hannes ^{[2
]}

Poetscher, Benedikt M. ^{[1
]}

机构：

[1] Univ Vienna, Dept Stat, A-1010 Vienna, Austria

[2] Yale Univ, Dept Stat, New Haven, CT 06520 USA

来源：

JOURNAL OF ECONOMETRICS | 2008年 / 142卷 / 01期

关键词：

oracle property; sparsity; penalized maximum likelihood; penalized least squares; Hodges' estimator; SCAD; lasso; bridge estimator; hard-thresholding; maximal risk; maximal absolute bias; nonuniform limits;

D O I：

10.1016/j.jeconom.2007.05.017

中图分类号：

F [经济];

学科分类号：

02 [经济学];

摘要：

We point out some pitfalls related to the concept of an oracle property as used in Fan and Li [2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348-1360; 2002. Variable selection for Cox's proportional hazards model and frailty model. Annals of Statistics 30, 74-99; 2004. New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis. Journal of the American Statistical Association 99, 710-723] which are reminiscent of the well-known pitfalls related to Hodges' estimator. The oracle property is often a consequence of sparsity of an estimator. We show that any estimator satisfying a sparsity property has maximal risk that converges to the supremum of the loss function; in particular, the maximal risk diverges to infinity whenever the loss function is unbounded. For ease of presentation the result is set in the framework of a linear regression model, but generalizes far beyond that setting. In a Monte Carlo study we also assess the extent of the problem in finite samples for the smoothly clipped absolute deviation (SCAD) estimator introduced in Fan and Li [2001. Variable selection via nonconcave penalized likelihood and its oracle properties. Journal of the American Statistical Association 96, 1348-1360]. We find that this estimator can perform rather poorly in finite samples and that its worst-case performance relative to maximum likelihood deteriorates with increasing sample size when the estimator is tuned to sparsity. (C) 2007 Elsevier B.V. All rights reserved.

引用

页码：201 / 211

页数：11

共 27 条

[1]

[Anonymous], FDN STAT INFERENCE

[2]

Covariate selection for semiparametric hazard function regression models [J].