Two faces of active learning

被引:160
作者
Dasgupta, Sanjoy [1 ]
机构
[1] Univ Calif San Diego, Dept Comp Sci & Engn, San Diego, CA 92103 USA
关键词
Active learning; Learning theory;
D O I
10.1016/j.tcs.2010.12.054
中图分类号
TP301 [理论、方法];
学科分类号
080201 [机械制造及其自动化];
摘要
An active learner has a collection of data points, each with a label that is initially hidden but can be obtained at some cost. Without spending too much, it wishes to find a classifier that will accurately map points to labels. There are two common intuitions about how this learning process should be organized: (i) by choosing query points that shrink the space of candidate classifiers as rapidly as possible; and (ii) by exploiting natural clusters in the (unlabeled) data set. Recent research has yielded learning algorithms for both paradigms that are efficient, work with generic hypothesis classes, and have rigorously characterized labeling requirements. Here we survey these advances by focusing on two representative algorithms and discussing their mathematical properties and empirical performance. (C) 2011 Elsevier B.V. All rights reserved.
引用
收藏
页码:1767 / 1781
页数:15
相关论文
共 20 条
[1]
Angluin D., 2001, Algorithmic Learning Theory. Ed. by, P12, DOI DOI 10.1007/3-540-45583-3_3
[2]
[Anonymous], INT C MACH LEARN
[3]
[Anonymous], C LEARN THEOR
[4]
[Anonymous], 2003, ICML WORKSH CONT LAB
[5]
[Anonymous], INT C MACH LEARN
[6]
BALCAN MF, 2008, P 21 ANN C LEARN THE
[7]
Bousquet O, 2004, LECT NOTES ARTIF INT, V3176, P169
[8]
Minimax bounds for active learning [J].
Castro, Rui M. ;
Nowak, Robert D. .
IEEE TRANSACTIONS ON INFORMATION THEORY, 2008, 54 (05) :2339-2353
[9]
Cesa-Bianchi N., 2004, Advances in neural information processing systems
[10]
IMPROVING GENERALIZATION WITH ACTIVE LEARNING [J].
COHN, D ;
ATLAS, L ;
LADNER, R .
MACHINE LEARNING, 1994, 15 (02) :201-221