Variable selection using MM algorithms

被引：339

作者：

Hunter, DR ^{[1
]}

Li, RZ ^{[1
]}

机构：

[1] Penn State Univ, Dept Stat, University Pk, PA 16802 USA

来源：

ANNALS OF STATISTICS | 2005年 / 33卷 / 04期

关键词：

AIC; BIC; EM algorithm; LASSO; MM algorithm; penalized likelihood; oracle estimator; SCAD;

D O I：

10.1214/009053605000000200

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

Variable selection is fundamental to high-dimensional statistical modeling. Many variable selection techniques may be implemented by maximum penalized likelihood using various penalty functions. Optimizing the penalized likelihood function is often challenging because it may be nondifferentiable and/or nonconcave. This article proposes a new class of algorithms for finding a maximizer of the penalized likelihood for a broad class of penalty functions. These algorithms operate by perturbing the penalty function slightly to render it differentiable, then optimizing this differentiable function using a minorize-maximize (MM) algorithm. MM algorithms are useful extensions of the well-known class of EM algorithms, a fact that allows us to analyze the local and global convergence of the proposed algorithm using some of the techniques employed for EM algorithms. In particular, we prove that when our MM algorithms converge, they must converge to a desirable point; we also discuss conditions under which this convergence may be guaranteed. We exploit the Newton-Raphson-like aspect of these algorithms to propose a sandwich estimator for the standard errors of the estimators. Our method performs well in numerical tests.

引用

页码：1617 / 1642

页数：26

共 24 条

[1] Regularization of wavelet approximations - Rejoinder [J].

Antoniadis, A ;

Fan, J .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (455) :964-967

[2]

Antoniadis A., 1997, J ITALIAN STAT ASS, V6, P97, DOI DOI 10.1007/BF03178905

[3]

CAI J, 2005, IN PRESS BIOMETRIKA

[4] PARTIAL LIKELIHOOD [J].

COX, DR .

BIOMETRIKA, 1975, 62 (02) :269-276

[5] SMOOTHING NOISY DATA WITH SPLINE FUNCTIONS [J].

WAHBA, G .

NUMERISCHE MATHEMATIK, 1975, 24 (05) :383-393

[6] MAXIMUM LIKELIHOOD FROM INCOMPLETE DATA VIA EM ALGORITHM [J].

DEMPSTER, AP ;

LAIRD, NM ;

RUBIN, DB .

JOURNAL OF THE ROYAL STATISTICAL SOCIETY SERIES B-METHODOLOGICAL, 1977, 39 (01) :1-38

[7] New estimation and model selection procedures for semiparametric modeling in longitudinal data analysis [J].

Fan, JQ ;

Li, R .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2004, 99 (467) :710-723

[8] Nonconcave penalized likelihood with a diverging number of parameters [J].

Fan, JQ ;

Peng, H .

ANNALS OF STATISTICS, 2004, 32 (03) :928-961

[9]

Fan JQ, 2002, ANN STAT, V30, P74

[10] Variable selection via nonconcave penalized likelihood and its oracle properties [J].

Fan, JQ ;

Li, RZ .

JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 2001, 96 (456) :1348-1360

← 1 2 3 →