Gradient methods for minimizing composite functions

被引:983
作者
Nesterov, Yu [1 ]
机构
[1] Catholic Univ Louvain, CORE, B-1348 Louvain, Belgium
关键词
Local optimization; Convex Optimization; Nonsmooth optimization; Complexity theory; Black-box model; Optimal methods; Structural optimization; l(1)-Regularization; MINIMIZATION;
D O I
10.1007/s10107-012-0629-5
中图分类号
TP31 [计算机软件];
学科分类号
081202 ; 0835 ;
摘要
In this paper we analyze several new methods for solving optimization problems with the objective function formed as a sum of two terms: one is smooth and given by a black-box oracle, and another is a simple general convex function with known structure. Despite the absence of good properties of the sum, such problems, both in convex and nonconvex cases, can be solved with efficiency typical for the first part of the objective. For convex problems of the above structure, we consider primal and dual variants of the gradient method (with convergence rate O (1/k)), and an accelerated multistep version with convergence rate O (1/k(2)), where k is the iteration counter. For nonconvex problems with this structure, we prove convergence to a point from which there is no descent direction. In contrast, we show that for general nonsmooth, nonconvex problems, even resolving the question of whether a descent direction exists from a point is NP-hard. For all methods, we suggest some efficient "line search" procedures and show that the additional computational work necessary for estimating the unknown problem class parameters can only multiply the complexity of each iteration by a small constant factor. We present also the results of preliminary computational experiments, which confirm the superiority of the accelerated scheme.
引用
收藏
页码:125 / 161
页数:37
相关论文
共 20 条
[1]  
[Anonymous], 2007, 200776 CORE
[2]   Atomic decomposition by basis pursuit [J].
Chen, SSB ;
Donoho, DL ;
Saunders, MA .
SIAM JOURNAL ON SCIENTIFIC COMPUTING, 1998, 20 (01) :33-61
[3]   ROBUST MODELING WITH ERRATIC DATA [J].
CLAERBOUT, JF ;
MUIR, F .
GEOPHYSICS, 1973, 38 (05) :826-844
[4]  
Figueiredo M., GRADIENT PROJE UNPUB
[5]   A GENERALIZED PROXIMAL POINT ALGORITHM FOR CERTAIN NONCONVEX MINIMIZATION PROBLEMS [J].
FUKUSHIMA, M ;
MINE, H .
INTERNATIONAL JOURNAL OF SYSTEMS SCIENCE, 1981, 12 (08) :989-1000
[6]  
Kim S.-J., 2007, RES REPORT
[7]   RECONSTRUCTION OF A SPARSE SPIKE TRAIN FROM A PORTION OF ITS SPECTRUM AND APPLICATION TO HIGH-RESOLUTION DECONVOLUTION [J].
LEVY, S ;
FULLAGAR, PK .
GEOPHYSICS, 1981, 46 (09) :1235-1243
[8]  
Miller A., 2002, Subset Selection in Regression
[9]  
Nemirovsky A., 1983, INFORM COMPLEXITY EF
[10]   Smooth minimization of non-smooth functions [J].
Nesterov, Y .
MATHEMATICAL PROGRAMMING, 2005, 103 (01) :127-152