Optimal stopping of Markov processes: Hilbert space theory, approximation algorithms, and an application to pricing high-dimensional financial derivatives

被引：178

作者：

Tsitsiklis, JN ^{[1
]}

Van Roy, B ^{[1
]}

机构：

[1] MIT, Informat & Decis Syst Lab, Cambridge, MA 02139 USA

来源：

IEEE TRANSACTIONS ON AUTOMATIC CONTROL | 1999年 / 44卷 / 10期

基金：

美国国家科学基金会;

关键词：

complex systems; curse of dimensionality; dynamic programming; function approximation; optimal stopping; stochastic approximation;

D O I：

10.1109/9.793723

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

The authors develop a theory characterizing optimal stopping times for discrete-time ergodic Markov processes with discounted rewards. The theory differs from prior work by its view of per-stage and terminal reward functions as elements of a certain Hilbert space. In addition to a streamlined analysis establishing existence and uniqueness of a solution to Bellman's equation, this approach provides an elegant framework for the study of approximate solutions. In particular, the authors propose a stochastic approximation algorithm that tunes weights of a linear combination of basis functions in order to approximate a value function. They prove that this algorithm converges (almost surely) and that the limit of convergence has some desirable properties. The utility of the approximation method is illustrated via a computational case study involving the pricing of a path-dependent financial derivative security that gives rise to an optimal stopping problem with a 100-dimensional state space.

引用

页码：1840 / 1851

页数：12

共 22 条

[11] MARTINGALES AND ARBITRAGE IN MULTIPERIOD SECURITIES MARKETS [J].

HARRISON, JM ;

KREPS, DM .

JOURNAL OF ECONOMIC THEORY, 1979, 20 (03) :381-408

[12] ON THE PRICING OF AMERICAN OPTIONS [J].

KARATZAS, I .

APPLIED MATHEMATICS AND OPTIMIZATION, 1988, 17 (01) :37-60

[13] THEORY OF RATIONAL OPTION PRICING [J].

MERTON, RC .

BELL JOURNAL OF ECONOMICS, 1973, 4 (01) :141-183

[14]

PARSLEY M, 1997, EUROMONEY, P127

[15]

RUST J, 1996, ECONOMETRICA

[16]

Shiryaev Albert N, 1978, Optimal Stopping Rules

[17]

Sutton R. S., 1988, Machine Learning, V3, P9, DOI 10.1023/A:1022633531479

[18]

Sutton R. S., 1995, P WORKSH VAL FUNCT A, P85

[19]

TSITSIKLIS JN, 1997, ADV NEURAL INFORMATI, V9

[20]

TSITSIKLIS JN, 1997, IEEE T AUTOMAT C MAY

← 1 2 3 →