On-line prediction and conversion strategies

被引:23
作者
CesaBianchi, N
Freund, Y
Helmbold, DP
Warmuth, MK
机构
[1] AT&T BELL LABS, MURRAY HILL, NJ 07974 USA
[2] UNIV CALIF SANTA CRUZ, DEPT COMP SCI, SANTA CRUZ, CA 95064 USA
关键词
on-line learning; conversion strategies; noise robustness; binomial weights; exponential weights; weighted majority algorithm; expert advice; mistake bounds; Ulam's game;
D O I
10.1023/A:1018348209754
中图分类号
TP18 [人工智能理论];
学科分类号
081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];
摘要
We study the problem of deterministically predicting boolean values by combining the boolean predictions of several experts. Previous on-line algorithms for this problem predict with the weighted majority of the experts' predictions. These algorithms give each expert an exponential weight beta(m) where beta is a constant in [0, 1) and m is the number of mistakes made by the expert in the past. We show that it is better to use sums of binomials as weights. In particular, we present a deterministic algorithm using binomial weights that has a better worst case mistake bound than the best deterministic algorithm using exponential weights. The binomial weights naturally arise from a version space argument. We also show how both exponential and binomial weighting schemes can be used to make prediction algorithms robust against noise.
引用
收藏
页码:71 / 110
页数:40
相关论文
共 21 条
[1]
Aarts E., 1989, Wiley-Interscience Series in Discrete Mathematics and Optimization
[2]
Alon N., 1992, PROBABILISTIC METHOD
[3]
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828
[4]
Aslam Javed A, 1991, P 23 ANN ACM S THEOR, P486, DOI 10.1145/103418.103469
[5]
Auer P., 1994, Proceedings of the Twenty-Sixth Annual ACM Symposium on the Theory of Computing, P263, DOI 10.1145/195058.195152
[6]
AUER P, IN PRESS MACHINE LEA
[7]
BARDZIN JM, 1972, SOV MATH DOKL, V13, P1224
[8]
BERLEKAMP ER, 1968, ERROR CORRECTING COD
[9]
Worst-case quadratic loss bounds for prediction using linear functions and gradient descent [J].
CesaBianchi, N ;
Long, PM ;
Warmuth, MK .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (03) :604-619
[10]
CESABIANCHI N, 1995, IN PRESS J ACM