Comparative study of stochastic algorithms for system optimization based on gradient approximations

被引:74
作者
Chin, DC
机构
[1] Applied Physics Laboratory, Johns Hopkins University, Laurel
来源
IEEE TRANSACTIONS ON SYSTEMS MAN AND CYBERNETICS PART B-CYBERNETICS | 1997年 / 27卷 / 02期
关键词
asymptotic normality; convergence rate; gradient approximation; Kiefer-Wolfowitz algorithm; optimization; stochastic approximation;
D O I
10.1109/3477.558808
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Stochastic approximation (SA) algorithms can be used in system optimization problems for which only noisy measurements of the system are available and the gradient of the loss function is not. This type of problem can be found in adaptive control, neural network training, experimental design, stochastic optimization, and many other areas. This paper studies three types of SA algorithms in a multivariate Kiefer-Wolfowitz setting, which uses only noisy measurements of the loss function (i.e., no loss function gradient measurements). The algorithms considered are: the standard finite-difference SA (FDSA) and two accelerated algorithms, the random-directions SA (RDSA) and the simultaneous-perturbation SA (SPSA). RDSA and SPSA use randomized gradient approximations based on (generally) far fewer function measurements than FDSA in each iteration, This paper describes the asymptotic error distribution for a class of RDSA algorithms, and compares the RDSA, SPSA, and FDSA algorithms theoretically (using mean-square errors computed from asymptotic distributions) and numerically. Based on the theoretical and numerical results, SPSA is the preferable algorithm to use.
引用
收藏
页码:244 / 249
页数:6
相关论文
共 20 条
[1]  
[Anonymous], 1971, S OPTIMIZING METHODS
[2]  
Bazaraa MokhtarS., 1979, Nonlinear Programming: Theory and Algorithms
[3]   MULTIDIMENSIONAL STOCHASTIC APPROXIMATION METHODS [J].
BLUM, JR .
ANNALS OF MATHEMATICAL STATISTICS, 1954, 25 (04) :737-744
[4]   ON A CLASS OF STOCHASTIC-APPROXIMATION PROCESSES [J].
BURKHOLDER, DL .
ANNALS OF MATHEMATICAL STATISTICS, 1956, 27 (04) :1044-1059
[5]  
CHIN DC, 1990, P STAT COMPUT SECTIO, P223
[6]   ON ASYMPTOTIC NORMALITY IN STOCHASTIC APPROXIMATION [J].
FABIAN, V .
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (04) :1327-&
[7]  
Goldstein L., 1988, J THEORETICAL PROBAB, V1, P189
[8]  
HO YC, 1991, PERTURBATION ANAL DI
[9]   STOCHASTIC ESTIMATION OF THE MAXIMUM OF A REGRESSION FUNCTION [J].
KIEFER, J ;
WOLFOWITZ, J .
ANNALS OF MATHEMATICAL STATISTICS, 1952, 23 (03) :462-466
[10]  
KUSHNER HJ, 1978, STOCHASTIC APPROXIMA