A probabilistic active support vector learning algorithm

被引:68
作者
Mitra, P [1 ]
Murthy, CA [1 ]
Pal, SK [1 ]
机构
[1] Indian Stat Inst, Machine Intelligence Unit, Kolkata 700108, W Bengal, India
关键词
data mining; learning theory; query learning; incremental learning; statistical query model; classification;
D O I
10.1109/TPAMI.2004.1262340
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The paper describes a probabilistic active learning strategy for support vector machine (SVM) design in large data applications. The learning strategy is motivated by the statistical query model. While most existing methods of active SVM learning query for points based on their proximity to the current separating hyperplane, the proposed method queries for a set of points according to a distribution as determined by the current separating hyperplane and a newly defined concept of an adaptive confidence factor. This enables the algorithm to have more robust and efficient learning capabilities. The confidence factor is estimated from local information using the k nearest neighbor principle. The effectiveness of the method is demonstrated on real-life data sets both in terms of generalization performance, query complexity, and training time.
引用
收藏
页码:413 / 418
页数:6
相关论文
共 18 条
[1]  
Blake C.L., 1998, UCI repository of machine learning databases
[2]   A tutorial on Support Vector Machines for pattern recognition [J].
Burges, CJC .
DATA MINING AND KNOWLEDGE DISCOVERY, 1998, 2 (02) :121-167
[3]  
Campbell Colin, 2000, ICML, V20
[4]   Active learning with statistical models [J].
Cohn, DA ;
Ghahramani, Z ;
Jordan, MI .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1996, 4 :129-145
[5]  
Kaufman L, 1999, ADVANCES IN KERNEL METHODS, P147
[6]  
Kearns M., 1993, Proceedings of the Twenty-Fifth Annual ACM Symposium on the Theory of Computing, P392, DOI 10.1145/167088.167200
[7]   INFORMATION-BASED OBJECTIVE FUNCTIONS FOR ACTIVE DATA SELECTION [J].
MACKAY, DJC .
NEURAL COMPUTATION, 1992, 4 (04) :590-604
[8]   DETERMINING THE SHAPE OF A PATTERN CLASS FROM SAMPLED POINTS IN R2 [J].
MANDAL, DP ;
MURTHY, CA ;
PAL, SK .
INTERNATIONAL JOURNAL OF GENERAL SYSTEMS, 1992, 20 (04) :307-339
[9]  
Mitra P, 2000, INT C PATT RECOG, P708, DOI 10.1109/ICPR.2000.906173
[10]  
Platt JC, 1999, ADVANCES IN KERNEL METHODS, P185