Efficient algorithms for online decision problems

被引:366
作者
Kalai, A
Vempala, S
机构
[1] Toyota Technol Inst, Dept Comp Sci, Chicago, IL 60637 USA
[2] MIT, Cambridge, MA 02139 USA
关键词
online algorithms; Hannan's algorithm; optimization; decision theory;
D O I
10.1016/j.jcss.2004.10.016
中图分类号
TP3 [计算技术、计算机技术];
学科分类号
0812 [计算机科学与技术];
摘要
In an online decision problem, one makes a sequence of decisions without knowledge of the future. Each period, one pays a cost based on the decision and observed state. We give a simple approach for doing nearly as well as the best single decision, where the best is chosen with the benefit of hindsight. A natural idea is to follow the leader, i.e. each period choose the decision which has done best so far. We show that by slightly perturbing the totals and then choosing the best decision, the expected performance is nearly as good as the best decision in hindsight. Our approach, which is very much like Hannan's original game-theoretic approach from the 1950s, yields guarantees competitive with the more modern exponential weighting algorithms like Weighted Majority. More importantly, these follow-the-leader style algorithms extend naturally to a large class of structured online problems for which the exponential algorithms are inefficient. (c) 2004 Elsevier Inc. All rights reserved.
引用
收藏
页码:291 / 307
页数:17
相关论文
共 26 条
[2]
Awerbuch B., 2004, Proceedings of the 36th ACM Symposiuim on Theory of Computing (STOC), P45
[3]
Awerbuch B., 2003, P 22 ANN S PRINCIPLE, P360
[4]
Static optimality and dynamic search-optimality in lists and trees [J].
Blum, A ;
Chawla, S ;
Kalai, A .
ALGORITHMICA, 2003, 36 (03) :249-260
[5]
BLUM A, 1997, CMUCS97163
[6]
Cover TM, 1991, Mathematical Finance, V1, P1, DOI [DOI 10.1111/J.1467-9965.1991.TB00002.X, https://doi.org/10.1111/j.1467-9965.1991.tb00002.x]
[7]
Dunagan J, 2001, LECT NOTES COMPUT SC, V2129, P229
[8]
Feige U., 1998, Proceedings of the Thirtieth Annual ACM Symposium on Theory of Computing, P90, DOI 10.1145/276698.276716
[9]
FOSTER DP, 1999, GAME ECON BEHAV, V29, P1084
[10]
FREUND Y, 1997, P 29 ANN ACM S THEOR, P334