On-line prediction and conversion strategies

被引：23

作者：

CesaBianchi, N

Freund, Y

Helmbold, DP

Warmuth, MK

机构：

[1] AT&T BELL LABS, MURRAY HILL, NJ 07974 USA

[2] UNIV CALIF SANTA CRUZ, DEPT COMP SCI, SANTA CRUZ, CA 95064 USA

来源：

MACHINE LEARNING | 1996年 / 25卷 / 01期

关键词：

on-line learning; conversion strategies; noise robustness; binomial weights; exponential weights; weighted majority algorithm; expert advice; mistake bounds; Ulam's game;

D O I：

10.1023/A:1018348209754

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 [模式识别与智能系统]; 0812 [计算机科学与技术]; 0835 [软件工程]; 1405 [智能科学与技术];

摘要：

We study the problem of deterministically predicting boolean values by combining the boolean predictions of several experts. Previous on-line algorithms for this problem predict with the weighted majority of the experts' predictions. These algorithms give each expert an exponential weight beta(m) where beta is a constant in [0, 1) and m is the number of mistakes made by the expert in the past. We show that it is better to use sums of binomials as weights. In particular, we present a deterministic algorithm using binomial weights that has a better worst case mistake bound than the best deterministic algorithm using exponential weights. The binomial weights naturally arise from a version space argument. We also show how both exponential and binomial weighting schemes can be used to make prediction algorithms robust against noise.

引用

页码：71 / 110

页数：40

共 21 条

[1]

Aarts E., 1989, Wiley-Interscience Series in Discrete Mathematics and Optimization

[2]

Alon N., 1992, PROBABILISTIC METHOD

[3]

Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828

[4]

Aslam Javed A, 1991, P 23 ANN ACM S THEOR, P486, DOI 10.1145/103418.103469

[5]

Auer P., 1994, Proceedings of the Twenty-Sixth Annual ACM Symposium on the Theory of Computing, P263, DOI 10.1145/195058.195152

[6]

AUER P, IN PRESS MACHINE LEA

[7]

BARDZIN JM, 1972, SOV MATH DOKL, V13, P1224

[8]

BERLEKAMP ER, 1968, ERROR CORRECTING COD

[9]

Worst-case quadratic loss bounds for prediction using linear functions and gradient descent [J].

CesaBianchi, N ;

Long, PM ;

Warmuth, MK .

IEEE TRANSACTIONS ON NEURAL NETWORKS, 1996, 7 (03) :604-619

[10]

CESABIANCHI N, 1995, IN PRESS J ACM

← 1 2 3 →