Neural network and regression spline value function approximations for stochastic dynamic programming

被引:36
作者
Cervellera, Cristiano
Wen, Aihong
Chen, Victoria C. P.
机构
[1] Univ Texas, Dept Ind & Mfg Syst Engn, Arlington, TX 76019 USA
[2] CNR, Inst Intelligent Syst Automat, ISSIA, I-16149 Genoa, Italy
关键词
design of experiments; statistical modeling; Markov decision process; orthogonal array; Latin hypercube; inventory forecasting; water reservoir management; ORTHOGONAL ARRAYS; REAL APPLICATIONS; MODELS; PREDICTION; SCHEMES; DESIGN; MARS;
D O I
10.1016/j.cor.2005.02.043
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
Dynamic programming is a multi-stage optimization method that is applicable to many problems in engineering. A statistical perspective of value function approximation in high-dimensional, continuous-state stochastic dynamic programming (SDP) was first presented using orthogonal array (OA) experimental designs and multivariate adaptive regression splines (MARS). Given the popularity of artificial neural networks (ANNs) for high-dimensional modeling in engineering, this paper presents an implementation of ANNs as an alternative to MARS. Comparisons consider the differences in methodological objectives, computational complexity, model accuracy, and numerical SDP solutions. Two applications are presented: a nine-dimensional inventory forecasting problem and an eight-dimensional water reservoir problem. Both OAs and OA-based Latin hypercube experimental designs are explored, and OA space-filling quality is considered. (c) 2005 Elsevier Ltd. All rights reserved.
引用
收藏
页码:70 / 90
页数:21
相关论文
共 65 条
[1]
Abdelaziz AY, 1997, ENG INTELL SYST ELEC, V5, P35
[2]
Constructing meta-models for computer experiments [J].
Allen, TT ;
Bernshteyn, MA ;
Kabiri-Bamoradian, K .
JOURNAL OF QUALITY TECHNOLOGY, 2003, 35 (03) :264-274
[3]
[Anonymous], 1998, Learning from data-concepts, theory and methods
[4]
[Anonymous], 2001, International Journal of Reliability and Applications
[5]
An aggregate stochastic dynamic programming model of multireservoir systems [J].
Archibald, TW ;
McKinnon, KIM ;
Thomas, LC .
WATER RESOURCES RESEARCH, 1997, 33 (02) :333-340
[6]
BAGLIETTO M, 2001, P IFAC WORKSH AD LEA
[7]
BAGLIETTO M, 2005, IN PRESS TOPICS PART
[8]
UNIVERSAL APPROXIMATION BOUNDS FOR SUPERPOSITIONS OF A SIGMOIDAL FUNCTION [J].
BARRON, AR .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1993, 39 (03) :930-945
[9]
BARRON AR, 1992, P 29 S INT, P192
[10]
DYNAMIC PROGRAMMING [J].
BELLMAN, R .
SCIENCE, 1966, 153 (3731) :34-&