Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives

被引:178
作者
Tsitsiklis, JN [1 ]
Van Roy, B [1 ]
机构
[1] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA
基金
美国国家科学基金会;
关键词
complex systems; curse of dimensionality; dynamic programming; function approximation; optimal stopping; stochastic approximation;
D O I
10.1109/9.793723
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
The authors develop a theory characterizing optimal stopping times for discrete-time ergodic Markov processes with discounted rewards. The theory differs from prior work by its view of per-stage and terminal reward functions as elements of a certain Hilbert space. In addition to a streamlined analysis establishing existence and uniqueness of a solution to Bellman's equation, this approach provides an elegant framework for the study of approximate solutions. In particular, the authors propose a stochastic approximation algorithm that tunes weights of a linear combination of basis functions in order to approximate a value function. They prove that this algorithm converges (almost surely) and that the limit of convergence has some desirable properties. The utility of the approximation method is illustrated via a computational case study involving the pricing of a path-dependent financial derivative security that gives rise to an optimal stopping problem with a 100-dimensional state space.
引用
收藏
页码:1840 / 1851
页数:12
相关论文
共 22 条
[1]  
[Anonymous], 1998, THESIS MIT
[2]  
Baird L., 1995, MACHINE LEARNING
[3]  
BARRAQUAND J, IN PRESS NUMERICAL V
[4]  
Benveniste A, 1990, Adaptive algorithms and stochastic approximations
[5]  
Bertsekas D., 1996, NEURO DYNAMIC PROGRA, V1st
[6]  
Bertsekas D. P., 1995, Dynamic Programming and Optimal Control
[7]  
BERTSEKAS DP, 1986, STOCHASTIC OPTIMAL C
[8]   PRICING OF OPTIONS AND CORPORATE LIABILITIES [J].
BLACK, F ;
SCHOLES, M .
JOURNAL OF POLITICAL ECONOMY, 1973, 81 (03) :637-654
[9]   Pricing American-style securities using simulation [J].
Broadie, M ;
Glasserman, P .
JOURNAL OF ECONOMIC DYNAMICS & CONTROL, 1997, 21 (8-9) :1323-1352
[10]  
Harrison J. M., 1981, Stochastic Processes & their Applications, V11, P215, DOI 10.1016/0304-4149(81)90026-0