Minimax bounds for active learning

被引:104
作者
Castro, Rui M. [1 ]
Nowak, Robert D. [1 ]
机构
[1] Univ Wisconsin, Dept Elect & Comp Engn, Madison, WI 53706 USA
基金
美国国家科学基金会;
关键词
active learning; adaptive sampling; minimax lower bounds; statistical learning theory;
D O I
10.1109/TIT.2008.920189
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
This paper analyzes the potential advantages and theoretical challenges of "active learning" algorithms. Active learning involves sequential sampling procedures that use information gleaned from previous samples in order to focus the sampling and accelerate the learning process relative to "passive learning" algorithms, which are based on nonadaptive (usually random) samples. There are a number of empirical and theoretical results suggesting that in certain situations active learning can be significantly more effective than passive learning. However, the fact that active learning algorithms are feedback systems makes their theoretical analysis very challenging. This paper aims to shed light on achievable limits in active learning. Using minimax analysis techniques, we study the achievable rates of classification error convergence for broad classes of distributions characterized by decision boundary regularity and noise conditions. The results clearly indicate the conditions under which one can expect significant gains through active learning. Furthermore, we show that the learning rates derived are tight for "boundary fragment" classes in d-dimensional feature spaces when the feature marginal density is bounded from above and below.
引用
收藏
页码:2339 / 2353
页数:15
相关论文
共 31 条
[1]  
[Anonymous], 2004, Advances in Neural Information Processing Systems
[2]  
[Anonymous], 2006, P 23 INT C MACHINE L
[3]  
[Anonymous], 2004, MATH APPL
[4]   NEURAL NET ALGORITHMS THAT LEARN IN POLYNOMIAL-TIME FROM EXAMPLES AND QUERIES [J].
BAUM, EB .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (01) :5-19
[5]   Hierarchical testing designs for pattern recognition [J].
Blanchard, G ;
Geman, D .
ANNALS OF STATISTICS, 2005, 33 (03) :1155-1202
[6]  
BRYAN B, 2005, ADV NEURAL INFORM PR
[7]  
Burnashev M. V., 1974, Problems of Information Transmission, V10, P223
[8]  
CASTRO R, 2006, P 44 ANN ALL C COMM, P225
[9]  
Castro Rui, 2005, ADV NEURAL INFORM PR
[10]   Nonparametric estimation of regression level sets [J].
Cavalier, L .
STATISTICS, 1997, 29 (02) :131-160