BILINEAR-PROGRAMMING AND STRUCTURED STOCHASTIC GAMES

被引:11
作者
FILAR, JA [1 ]
SCHULTZ, TA [1 ]
机构
[1] JOHNS HOPKINS UNIV,DEPT MATH SCI,BALTIMORE,MD 21218
关键词
BILINEAR PROGRAMMING - DISCOUNTED STOCHASTIC GAMES - ONE-STEP ALGORITHMS - STATIONARY STRATEGIES - STRUCTURED STOCHASTIC GAMES;
D O I
10.1007/BF00938818
中图分类号
C93 [管理学]; O22 [运筹学];
学科分类号
070105 ; 12 ; 1201 ; 1202 ; 120202 ;
摘要
One-step algorithms are presented for two classes of structured stochastic games, namely, those with additive rewards and transitions and those which have switching controllers. Solutions to such classes of games under the average reward criterion can be derived from optimal solutions to appropriate bilinear programs. The validity of using bilinear programming as a solution method follows from two preliminary theorems, the first of which is a complete classification of undiscounted stochastic games with optimal stationary strategies. The second of these preliminary theorems relaxes the conditions of the classification theorem for certain classes of stochastic games and provides the basis for the bilinear programming results. Analogous results hold for the discounted stochastic games with the above special structures.
引用
收藏
页码:85 / 104
页数:20
相关论文
共 28 条
[1]  
[Anonymous], 1966, MANAGEMENT SCI, DOI 10.1287/mnsc.12.5.359
[2]  
Bewley T., 1976, Mathematics of Operations Research, V1, P197, DOI 10.1287/moor.1.3.197
[3]  
Bewley T., 1978, Mathematics of Operations Research, V3, P104, DOI 10.1287/moor.3.2.104
[4]  
Bewley T., 1976, Mathematics of Operations Research, V1, P321, DOI 10.1287/moor.1.4.321
[5]   BIG MATCH [J].
BLACKWELL, D ;
FERGUSON, TS .
ANNALS OF MATHEMATICAL STATISTICS, 1968, 39 (01) :159-+
[6]  
BLACKWELL D, 1962, ANN MATH STATISTICS, V33, P104
[7]  
FAIZ A, 1983, MATH OPERATIONS RES, V8, P273
[8]   SUCCESSIVE APPROXIMATION METHODS IN UNDISCOUNTED STOCHASTIC GAMES [J].
FEDERGRUEN, A .
OPERATIONS RESEARCH, 1980, 28 (03) :794-809
[9]  
Filar J.A., 1986, MATH PROGRAM, V35, P243
[10]   ORDERED FIELD PROPERTY FOR STOCHASTIC GAMES WHEN THE PLAYER WHO CONTROLS TRANSITIONS CHANGES FROM STATE TO STATE [J].
FILAR, JA .
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1981, 34 (04) :503-515