On the entropic regularization method for solving min-max problems with applications

被引:93
作者
Li, XS [1 ]
Fang, SC
机构
[1] Dalian Univ Technol, Res Inst Engn Mech, Dalian 116024, Peoples R China
[2] N Carolina State Univ, Raleigh, NC 27695 USA
关键词
min-max problem; linear and nonlinear programming; entropy optimization principles;
D O I
10.1007/BF01199466
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
Consider a min-max problem in the form of min(x is an element of max1 less than or equal to i less than or equal to m){f(i)(x)}. It is well-known that the non-differentiability of the max function F(x) equivalent to max(1 less than or equal to i less than or equal to m){f(i)(x)} presents difficulty in finding an optimal solution. An entropic regularization procedure provides a smooth approximation F-p(x) that uniformly converges to F(x) over X with a difference bounded by ln(m)/p, for p > 0. In this way, with p being sufficiently large, minimizing the smooth function F,(x) over X provides a very accurate solution to the min-max problem. The same procedure can be applied to solve systems of inequalities, linear programming problems, and constrained min-max problems.
引用
收藏
页码:119 / 130
页数:12
相关论文
共 20 条
[1]  
BENTAL A, 1989, LECT NOTES MATH, V1405, P1
[2]  
Bertsekas D. P., 2019, Reinforcement learning and optimal control
[3]   EFFICIENT METHOD TO SOLVE MINIMAX PROBLEM DIRECTLY [J].
CHARALAMBOUS, C ;
CONN, AR .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1978, 15 (01) :162-187
[4]   A SMOOTH METHOD FOR THE FINITE MINIMAX PROBLEM [J].
DIPILLO, G ;
GRIPPO, L ;
LUCIDI, S .
MATHEMATICAL PROGRAMMING, 1993, 60 (02) :187-214
[5]  
Fang S.-C., 1993, Linear Optimization and Extensions: Theory and Algorithms, VFirst
[6]  
Fletcher R., 1981, Practical methods of optimization, volume 2, Constrained Optimization, V2
[7]   A REGULARIZATION METHOD FOR SOLVING THE FINITE CONVEX MIN-MAX PROBLEM [J].
GIGOLA, C ;
GOMEZ, S .
SIAM JOURNAL ON NUMERICAL ANALYSIS, 1990, 27 (06) :1621-1634
[8]  
Hiriart-Urruty J. B., 1996, CONVEX ANAL MINIMIZA, V305
[9]  
Huard P., 1967, NONLINEAR PROGRAMMIN, P207
[10]   INFORMATION THEORY AND STATISTICAL MECHANICS [J].
JAYNES, ET .
PHYSICAL REVIEW, 1957, 106 (04) :620-630