STATE-OF-THE-ART - A SURVEY OF PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - THEORY, MODELS, AND ALGORITHMS

被引:508
作者
MONAHAN, GE
机构
关键词
COMPUTER PROGRAMMING - Subroutines;
D O I
10.1287/mnsc.28.1.1
中图分类号
C93 [管理学];
学科分类号
12 ; 1201 ; 1202 ; 120202 ;
摘要
This study surveys models and algorithms dealing with partially observable Markov decision processes. A partially observable Markov decision process (POMDP)is a generalization of a Markov decision process which permits uncertainty regarding the state of a Markov process and allows for state information acquisition. A general framework for finite state and action POMDP's is presented. There is also a brief discussion of the development of POMDP's and their relationship with other decision processes. A wide range of models in such areas as quality control, machine maintenance, internal auditing, learning, and optimal stopping are discussed within the POMDP-framework. Algorithms for computing optimal solutions to POMDP's are presented.
引用
收藏
页码:1 / 16
页数:16
相关论文
共 87 条
[31]   INSPECTION - MAINTENANCE - REPLACEMENT SCHEDULES UNDER MARKOVIAN DETERIORATION [J].
KLEIN, M .
MANAGEMENT SCIENCE, 1962, 9 (01) :25-32
[32]  
KLEINROCK L, 1975, IEEE T COMMUN, VCO23, P410, DOI 10.1109/TCOM.1975.1092814
[33]   PACKET SWITCHING IN A MULTIACCESS BROADCAST CHANNEL - DYNAMIC CONTROL PROCEDURES [J].
LAM, SS ;
KLEINROCK, L .
IEEE TRANSACTIONS ON COMMUNICATIONS, 1975, 23 (09) :891-904
[34]  
MILLER BL, 1979, UCLA288 WEST MAN SCI
[35]   OPTIMAL STOPPING IN A PARTIALLY OBSERVABLE MARKOV PROCESS WITH COSTLY INFORMATION [J].
MONAHAN, GE .
OPERATIONS RESEARCH, 1980, 28 (06) :1319-1334
[36]  
MONAHAN GE, 1982, UNPUB J APPL PROBABI, V19
[37]  
Nahmias S., 1975, Cahiers du Centre d'Etudes de Recherche Operationelle, V17, P53
[38]  
Paz A., 1971, INTRO PROBABILISTIC
[39]   SURVEY OF MAINTENANCE MODELS - CONTROL AND SURVEILLANCE OF DETERIORATING SYSTEMS [J].
PIERSKALLA, WP ;
VOELKER, JA .
NAVAL RESEARCH LOGISTICS, 1976, 23 (03) :353-388
[40]  
PLATZMAN L, THESIS MIT