ESTIMATION OF THE DERIVATIVE OF A STATIONARY MEASURE WITH RESPECT TO A CONTROL PARAMETER

被引:26
作者
VAZQUEZABAD, FJ
KUSHNER, HJ
机构
关键词
MONTE-CARLO OPTIMIZATION; SYSTEM SENSITIVITY; STATIONARY SYSTEMS OPTIMIZATION; PARAMETRIC ERGODIC OPTIMIZATION; STOCHASTIC APPROXIMATION;
D O I
10.2307/3214571
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
The paper deals with a problem which arises in the Monte Carlo optimization of steady state or ergodic systems which can be modelled by Markov chains. The transition probability depends on a parameter, and one wishes to find the parameter value at which some performance function is minimum. The only available data are obtained from either simulation or actual operating information. For such a problem one needs good statistical estimates of the derivatives. Conditions are given for the existence of the derivative of the stationary measure with respect to the parameter, in the sense that the derivative is a signed measure, and is the limit of the natural approximating sequence. Some properties and a useful characterization of the derivative are obtained. It is also shown that, under appropriate conditions, the derivative of the n-step transition function converges to the derivative of the stationary measure as n tends to infinity. This latter result is of particular importance whether one is simply estimating or is actually optimizing via some sequential Monte Carlo procedure, since the basic observations are always taken over a finite time interval.
引用
收藏
页码:343 / 352
页数:10
相关论文
共 14 条
[1]  
EIMAN MI, 1989, OPER RES, V37, P820
[2]  
GLASSERMAN P, 1989, DERIVATIVE ESTIMATES
[3]  
Glynn P. W., 1986, 1986 Winter Simulation Conference Proceedings, P356, DOI 10.1145/318242.318459
[4]  
Glynn P. W., 1988, Queueing Systems Theory and Applications, V3, P221, DOI 10.1007/BF01161216
[5]  
GLYNN PW, 1987, 1987 P WINT SIM M IE
[6]   USING THE QR FACTORIZATION AND GROUP INVERSION TO COMPUTE, DIFFERENTIATE, AND ESTIMATE THE SENSITIVITY OF STATIONARY PROBABILITIES FOR MARKOV-CHAINS [J].
GOLUB, GH ;
MEYER, CD .
SIAM JOURNAL ON ALGEBRAIC AND DISCRETE METHODS, 1986, 7 (02) :273-281
[7]  
HO Y, 1983, AUTOMATICA, V39, P149
[8]   ON USING PERTURBATION ANALYSIS TO DO SENSITIVITY ANALYSIS - DERIVATIVES VS DIFFERENCES [J].
HOLTZMAN, JM .
PROCEEDINGS OF THE 28TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-3, 1989, :2018-2023
[9]   AVERAGING METHODS FOR THE ASYMPTOTIC ANALYSIS OF LEARNING AND ADAPTIVE SYSTEMS, WITH SMALL ADJUSTMENT RATE [J].
KUSHNER, HJ ;
HUANG, H .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1981, 19 (05) :635-650
[10]  
Revuz D., 2008, MARKOV CHAINS