STATE-OF-THE-ART - A SURVEY OF PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES - THEORY, MODELS, AND ALGORITHMS

被引：508

作者：

MONAHAN, GE

机构：

来源：

MANAGEMENT SCIENCE | 1982年 / 28卷 / 01期

关键词：

COMPUTER PROGRAMMING - Subroutines;

D O I：

10.1287/mnsc.28.1.1

中图分类号：

C93 [管理学];

学科分类号：

12 ; 1201 ; 1202 ; 120202 ;

摘要：

This study surveys models and algorithms dealing with partially observable Markov decision processes. A partially observable Markov decision process (POMDP)is a generalization of a Markov decision process which permits uncertainty regarding the state of a Markov process and allows for state information acquisition. A general framework for finite state and action POMDP's is presented. There is also a brief discussion of the development of POMDP's and their relationship with other decision processes. A wide range of models in such areas as quality control, machine maintenance, internal auditing, learning, and optimal stopping are discussed within the POMDP-framework. Algorithms for computing optimal solutions to POMDP's are presented.

引用

页码：1 / 16

页数：16

共 87 条

[1] STRUCTURAL RESULTS FOR PARTIALLY OBSERVABLE MARKOV DECISION-PROCESSES [J].