STATE-OF-THE-ART - A SURVEY OF PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - THEORY, MODELS, AND ALGORITHMS

被引:508
作者
MONAHAN, GE
机构
关键词
COMPUTER PROGRAMMING - Subroutines;
D O I
10.1287/mnsc.28.1.1
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This study surveys models and algorithms dealing with partially observable Markov decision processes. A partially observable Markov decision process (POMDP)is a generalization of a Markov decision process which permits uncertainty regarding the state of a Markov process and allows for state information acquisition. A general framework for finite state and action POMDP's is presented. There is also a brief discussion of the development of POMDP's and their relationship with other decision processes. A wide range of models in such areas as quality control, machine maintenance, internal auditing, learning, and optimal stopping are discussed within the POMDP-framework. Algorithms for computing optimal solutions to POMDP's are presented.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 87 条
[41]  
PLATZMAN L, 1977, MIT ESLR723 EL SYST
[42]  
PLATZMAN L, 1978, STATE ESTIMATION PAR
[43]   OPTIMAL INFINITE-HORIZON UNDISCOUNTED CONTROL OF FINITE PROBABILISTIC SYSTEMS [J].
PLATZMAN, LK .
SIAM JOURNAL ON CONTROL AND OPTIMIZATION, 1980, 18 (04) :362-380
[44]  
PLATZMAN LK, 1981, FEASIBLE COMPUTATION
[45]   A SIMPLE MODEL OF SEARCH FOR A MOVING TARGET [J].
POLLOCK, SM .
OPERATIONS RESEARCH, 1970, 18 (05) :883-&
[46]   OPTIMALITY OF STRUCTURED POLICIES IN COUNTABLE STAGE DECISION PROCESSES [J].
PORTEUS, EL .
MANAGEMENT SCIENCE, 1975, 22 (02) :148-157
[47]   INCOMPLETE INFORMATION IN MARKOVIAN DECISION MODELS [J].
RHENIUS, D .
ANNALS OF STATISTICS, 1974, 2 (06) :1327-1334
[48]  
RIEDER U, 1975, ADV APPL PROBABILITY, V7, P720
[49]   MARKOVIAN DETERIORATION WITH UNCERTAIN INFORMATION - MORE GENERAL-MODEL [J].
ROSENFIELD, D .
NAVAL RESEARCH LOGISTICS, 1976, 23 (03) :389-405
[50]   MARKOVIAN DETERIORATION WITH UNCERTAIN INFORMATION [J].
ROSENFIELD, D .
OPERATIONS RESEARCH, 1976, 24 (01) :141-155