A KNOWLEDGE-GRADIENT POLICY FOR SEQUENTIAL INFORMATION COLLECTION

被引：304

作者：

Frazier, Peter I. ^{[1
]}

Powell, Warren B. ^{[1
]}

Dayanik, Savas ^{[1
]}

机构：

[1] Princeton Univ, Dept Operat Res & Financial Engn, Princeton, NJ 08544 USA

来源：

SIAM JOURNAL ON CONTROL AND OPTIMIZATION | 2008年 / 47卷 / 05期

关键词：

ranking and selection; Bayesian statistics; sequential decision analysis;

D O I：

10.1137/070693424

中图分类号：

TP [自动化技术、计算机技术];

学科分类号：

0812 ;

摘要：

In a sequential Bayesian ranking and selection problem with independent normal populations and common known variance, we study a previously introduced measurement policy which we refer to as the knowledge-gradient policy. This policy myopically maximizes the expected increment in the value of information in each time period, where the value is measured according to the terminal utility function. We show that the knowledge-gradient policy is optimal both when the horizon is a single time period and in the limit as the horizon extends to infinity. We show furthermore that, in some special cases, the knowledge-gradient policy is optimal regardless of the length of any given fixed total sampling horizon. We bound the knowledge-gradient policy's suboptimality in the remaining cases, and show through simulations that it performs competitively with or significantly better than other policies.

引用

页码：2410 / 2439

页数：30

共 28 条

[1]

[Anonymous], 2007, Simulation-based algorithms for Markov decision processes

[2]

Bechhofer R. E., 1995, Design and analysis of experiments for statistical selection, screening, and multiple comparisons

[3] New developments in ranking and selection: An empirical comparison of the three main approaches [J].

Branke, J ;

Chick, SE ;

Schmidt, C .

PROCEEDINGS OF THE 2005 WINTER SIMULATION CONFERENCE, VOLS 1-4, 2005, :708-717

[4]

Chen CH, 1995, PROCEEDINGS OF THE 34TH IEEE CONFERENCE ON DECISION AND CONTROL, VOLS 1-4, P2598, DOI 10.1109/CDC.1995.478499

[5] Simulation budget allocation for further enhancing the efficiency of ordinal optimization [J].

Chen, CH ;

Lin, JW ;

Yücesan, E ;

Chick, SE .

DISCRETE EVENT DYNAMIC SYSTEMS-THEORY AND APPLICATIONS, 2000, 10 (03) :251-270

[6] Optimal computing budget allocation for Monte Carlo simulation with application to product design [J].

Chen, CH ;

Donohue, K ;

Yücesan, E ;

Lin, JW .

SIMULATION MODELLING PRACTICE AND THEORY, 2003, 11 (01) :57-74

[7]

CHEN CH, 1996, P 1996 WINT SIM C, P398

[8] Computing efforts allocation for ordinal optimization and discrete event simulation [J].

Chen, HC ;

Chen, CH ;

Yücesan, E .

IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2000, 45 (05) :960-964

[9] Reusing analogous components [J].

Cheng, BHC ;

Jeng, JJ .

IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 1997, 9 (02) :341-349

[10]

CHICK S, INFORMS J COMP UNPUB

← 1 2 3 →