Confidence-based active learning

被引:172
作者
Li, Mingkun
Sethi, Ishwar K.
机构
[1] Lawrence Berkeley Natl Lab, DOE Joint Genome Inst, Walnut Creek, CA 94598 USA
[2] Oakland Univ, Dept Comp Sci & Engn, Rochester, MI 48309 USA
关键词
active learning; error estimation; pattern classification; support vector machines;
D O I
10.1109/TPAMI.2006.156
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
This paper proposes a new active learning approach, confidence- based active learning, for training a wide range of classifiers. This approach is based on identifying and annotating uncertain samples. The uncertainty value of each sample is measured by its conditional error. The approach takes advantage of current classifiers' probability preserving and ordering properties. It calibrates the output scores of classifiers to conditional error. Thus, it can estimate the uncertainty value for each input sample according to its output score from a classifier and select only samples with uncertainty value above a user- defined threshold. Even though we cannot guarantee the optimality of the proposed approach, we find it to provide good performance. Compared with existing methods, this approach is robust without additional computational effort. A new active learning method for support vector machines ( SVMs) is implemented following this approach. A dynamic bin width allocation method is proposed to accurately estimate sample conditional error and this method adapts to the underlying probabilities. The effectiveness of the proposed approach is demonstrated using synthetic and real data sets and its performance is compared with the widely used least certain active learning method.
引用
收藏
页码:1251 / 1261
页数:11
相关论文
共 40 条
[1]  
[Anonymous], 1999, The Nature Statist. Learn. Theory
[2]  
[Anonymous], P 17 INT C MACH LEAR
[3]  
[Anonymous], 2002, P 8 INT C KNOWL DISC
[4]  
[Anonymous], ADV LARGE MARGIN CLA
[5]  
Bain L., 1991, Introduction to probability and mathematical statistics
[6]  
Baram Y, 2004, J MACH LEARN RES, V5, P255
[7]  
Bazaraa M. S., 2013, NONLINEAR PROGRAMMIN
[8]  
Blake C.L., 1998, UCI repository of machine learning databases
[9]  
Blum A, 1998, LECT NOTES COMPUT SC, V1442, P306, DOI 10.1007/BFb0029575
[10]  
Brinker K., 2003, Machine Learning, Proceedings of the Twentieth International Conference (ICML 2003), August 21-24, 2003, P59