Adaptive and self-confident on-line learning algorithms

被引：127

作者：

Auer, P

Cesa-Bianchi, N

Gentile, C

机构：

[1] Graz Univ Technol, Inst Theoret Comp Sci, A-8010 Graz, Austria

[2] Univ Milan, Dept Informat Technol, I-26013 Crema, Italy

[3] Univ Milan, Dept Informat Sci, I-20135 Milan, Italy

来源：

JOURNAL OF COMPUTER AND SYSTEM SCIENCES | 2002年 / 64卷 / 01期

关键词：

on-line prediction; linear regression; quasi-additive algorithms; learning rate;

D O I：

10.1006/jcss.2001.1795

中图分类号：

TP3 [计算技术、计算机技术];

学科分类号：

0812 ;

摘要：

We study on-line learning in the linear regression framework. Most oh the performance bounds for on-line algorithms in this framework assume a constant learning rate. To achieve these bounds the learning rate trust be optimized based on a posteriori information. This information depends on the whole sequence of examples and thus it is not available to any strictly on-line algorithm. We introduce new techniques for adaptively tuning the learning rate as the data sequence is progressively revealed. Our techniques allow us to prove essentially the same bounds as if we knew the optimal learning rate in advance. Moreover, such techniques apply to a wide class of on-line algorithms, including p-norm algorithms for generalized linear regression and Weighted Majority for linear regression with absolute loss. Our adaptive tunings are radically different from previous techniques, such as the so-called doubling trick. Whereas the doubling trick restarts the on-line algorithm several times using a constant learning rate for each run, our methods save information by changing the value of the learning rate very smoothly. In fact, for Weighted Majority over a finite set of experts our analysis provides a better leading constant than the doubling trick. (C) 2002 Elsevier science (USA).

引用

页码：48 / 75

页数：28

共 37 条

[1]

Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1023/A:1022821128753

[2]

[Anonymous], S MATH THEORY AUTOMA

[3] Tracking the best disjunction [J].

Auer, P ;

Warmuth, MK .

MACHINE LEARNING, 1998, 32 (02) :127-150

[4]

AZOURY K, IN PRESS MACH LEARNI

[5] PERCEPTRON - A MODEL FOR BRAIN FUNCTIONING .1. [J].

BLOCK, HD .

REVIEWS OF MODERN PHYSICS, 1962, 34 (01) :123-&

[6]

Bregman LM, 1967, USSR Computational Mathematics and Mathematical Physics, V7, P200

[7]

BYLANDER T, 1997, P 8 ANN C COMP LEARN

[8] AN ITERATIVE ROW-ACTION METHOD FOR INTERVAL CONVEX-PROGRAMMING [J].

CENSOR, Y ;

LENT, A .

JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 1981, 34 (03) :321-353

[9]

CENSOR Y, 1997, PARALLEL OPTIZIMATIO

[10] Analysis of two gradient-based algorithms for on-line regression [J].

Cesa-Bianchi, N .

JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1999, 59 (03) :392-411

← 1 2 3 4 →