AGGREGATION OF THE POLICY ITERATION METHOD FOR NEARLY COMPLETELY DECOMPOSABLE MARKOV-CHAINS

被引:23
作者
ALDHAHERI, RW
KHALIL, HK
机构
[1] KING ABDULAZIZ UNIV,DEPT ELECT ENGN,JEDDAH 21413,SAUDI ARABIA
[2] MICHIGAN STATE UNIV,DEPT ELECT ENGN,E LANSING,MI 48824
关键词
D O I
10.1109/9.67293
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper studies a steady-state optimal control problem for nearly completely decomposable Markov chains. In order to apply the policy iteration method of Howard, a high-dimensional ill-conditioned system of algebraic equations must be solved in the value determination step. Although algorithms exist for aggregation of the steady-state probability distribution problem, they only provide methods for computing the cost but not the dual variables. Using a singular perturbation approach, an aggregation method for the value determination equation is developed. The aggregation method is developed in three steps. First, a class of similarity transformations that transform the system into a singularly perturbed form is developed. Second, an aggregation method to compute the steady-state probability distribution is derived. Third, this aggregation method is appled to the value determination step of Howard's method.
引用
收藏
页码:178 / 187
页数:10
相关论文
共 32 条
[1]  
ALDAHERI RW, 1988, THESIS MICHIGAN STAT
[2]  
Bertsekas D.P., 1987, ABSTRACT DYNAMIC PRO
[3]   ITERATIVE AGGREGATION DISAGGREGATION TECHNIQUES FOR NEARLY UNCOUPLED MARKOV-CHAINS [J].
CAO, WL ;
STEWART, WJ .
JOURNAL OF THE ACM, 1985, 32 (03) :702-719
[4]   HIERARCHICAL AGGREGATION OF SINGULARLY PERTURBED FINITE STATE MARKOV PROCESSES. [J].
Coderch, M. ;
Willsky, A.S. ;
Sastry, S.S. ;
Castanon, D.A. .
Stochastics, 1983, 8 (04) :259-289
[5]  
COHEN AI, 1982, RES DEV UNIFIED APPR
[6]   ERROR ANALYSIS IN NEARLY COMPLETELY DECOMPOSABLE STOCHASTIC SYSTEMS [J].
COURTOIS, PJ .
ECONOMETRICA, 1975, 43 (04) :691-709
[7]   BLOCK ITERATIVE ALGORITHMS FOR STOCHASTIC MATRICES [J].
COURTOIS, PJ ;
SEMAL, P .
LINEAR ALGEBRA AND ITS APPLICATIONS, 1986, 76 :59-70
[8]   DECOMPOSABILITY, INSTABILITIES, AND SATURATION IN MULTIPROGRAMMING SYSTEMS [J].
COURTOIS, PJ .
COMMUNICATIONS OF THE ACM, 1975, 18 (07) :371-377
[9]  
COURTOIS PJ, 1977, DECOMPOSABILITY
[10]   A UNIFIED VIEW OF AGGREGATION AND COHERENCY IN NETWORKS AND MARKOV-CHAINS [J].
DELEBECQUE, F ;
QUADRAT, JP ;
KOKOTOVIC, PV .
INTERNATIONAL JOURNAL OF CONTROL, 1984, 40 (05) :939-952