Model selection and estimation in regression with grouped variables

被引:1136
作者
Yuan, M [1 ]
Lin, Y
机构
[1] Georgia Inst Technol, Sch Ind & Syst Engn, Atlanta, GA 30332 USA
[2] Univ Wisconsin, Madison, WI USA
关键词
analysis of variance; Lasso; least angle regression; non-negative garrotte; piecewise linear solution path;
D O I
10.1111/j.1467-9868.2005.00532.x
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
We consider the problem of selecting grouped variables (factors) for accurate prediction in regression. Such a problem arises naturally in many practical situations with the multifactor analysis-of-variance problem as the most important and well-known example. Instead of selecting factors by stepwise backward elimination, we focus on the accuracy of estimation and consider extensions of the lasso, the LARS algorithm and the non-negative garrotte for factor selection. The lasso, the LARS algorithm and the non-negative garrotte are recently proposed regression methods that can be used to select individual variables. We study and propose efficient algorithms for the extensions of these methods for factor selection and show that these extensions give superior performance to the traditional stepwise backward elimination method in factor selection problems. We study the similarities and the differences between these methods. Simulations and real examples are used to illustrate the methods.
引用
收藏
页码:49 / 67
页数:19
相关论文
共 14 条
[1]  
Bakin S., 1999, Ph.D. Thesis
[2]   BETTER SUBSET REGRESSION USING THE NONNEGATIVE GARROTE [J].
BREIMAN, L .
TECHNOMETRICS, 1995, 37 (04) :373-384
[3]   Least angle regression - Rejoinder [J].
Efron, B ;
Hastie, T ;
Johnstone, I ;
Tibshirani, R .
ANNALS OF STATISTICS, 2004, 32 (02) :494-499
[4]   Variable selection via nonconcave penalized likelihood and its oracle properties [J].
Fan, JQ ;
Li, RZ .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360
[5]   THE RISK INFLATION CRITERION FOR MULTIPLE-REGRESSION [J].
FOSTER, DP ;
GEORGE, EI .
ANNALS OF STATISTICS, 1994, 22 (04) :1947-1975
[6]   Penalized regressions: The bridge versus the lasso [J].
Fu, WJJ .
JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS, 1998, 7 (03) :397-416
[7]   VARIABLE SELECTION VIA GIBBS SAMPLING [J].
GEORGE, EI ;
MCCULLOCH, RE .
JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1993, 88 (423) :881-889
[8]   Calibration and empirical Bayes variable selection [J].
George, EI ;
Foster, DP .
BIOMETRIKA, 2000, 87 (04) :731-747
[9]  
Hosmer D. W., 1989, APPL LOGISTIC REGRES, DOI DOI 10.1097/00019514-200604000-00003
[10]  
LIN Y, 2003, 1072 U WISC DEP STAT