Least angle regression - Rejoinder

被引：5916

作者：

Efron, B ^{[1
]}

Hastie, T ^{[1
]}

Johnstone, I ^{[1
]}

Tibshirani, R ^{[1
]}

机构：

[1] Stanford Univ, Dept Stat, Stanford, CA 94305 USA

来源：

ANNALS OF STATISTICS | 2004年 / 32卷 / 02期

关键词：

Boosting; Coefficient paths; Lasso; Linear regression; Variable selection;

D O I：

10.1214/009053604000000067

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

The purpose of model selection algorithms such as All Subsets, Forward Selection and Backward Elimination is to choose a linear model on the basis of the same set of data to which the model will be applied. Typically we have available a large collection of possible covariates from which we hope to select a parsimonious set for the efficient prediction of a response variable. Least Angle Regression (LARS), a new model selection algorithm, is a useful and less greedy version of traditional forward selection methods. Three main properties are derived: (1) A simple modification of the LARS algorithm implements the Lasso, an attractive version of ordinary least squares that constrains the sum of the absolute regression coefficients; the LARS modification calculates all possible Lasso estimates for a given problem, using an order of magnitude less computer time than previous methods. (2) A different LARS modification efficiently implements Forward Stagewise linear regression, another promising new model selection method; this connection explains the similar numerical results previously observed for the Lasso and Stagewise, and helps us understand the properties of both methods, which are seen as constrained versions of the simpler LARS algorithm. (3) A simple approximation for the degrees of freedom of a LARS estimate is available, from which we derive a Cp estimate of prediction error; this allows a principled choice among the range of possible LARS estimates. LARS and its variants are computationally efficient: the paper describes a publicly available algorithm that requires only the same order of magnitude of computational effort as ordinary least squares applied to the full set of covariates. © Institute of Mathematical Statistics, 2004.

引用

页码：494 / 499

页数：6

共 6 条

[1] ABRAMOVICH F, 2000, 200019 STANF U DEP S
[2] [Anonymous], 2001, Journal of the European Mathematical Society, DOI DOI 10.1007/S100970100031
[3] EFRON B, 2004, IN PRESS J AM STAT A
[4] FOSTER D, 1997, INFORMATION THEORETI
[5] Calibration and empirical Bayes variable selection
George, EI
Foster, DP
[J]. BIOMETRIKA, 2000, 87 (04) : 731 - 747
[6] Monotone shrinkage of trees
LeBlanc, M
Tibshirani, R
[J]. JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (04) : 417 - 433

← 1 →