Active learning with multiple views

被引:142
作者
Muslea, Ion
Minton, Steven
Knoblock, Craig A.
机构
[1] Language Weaver Inc, Marina Del Rey, CA 90292 USA
[2] Fetch Technol Inc, El Segundo, CA 90245 USA
[3] Univ So Calif, Marina Del Rey, CA 90292 USA
基金
美国国家科学基金会;
关键词
D O I
10.1613/jair.2005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learners alleviate the burden of labeling large amounts of data by detecting and asking the user to label only the most informative examples in the domain. We focus here on active learning for multi-view domains, in which there are several disjoint subsets of features (views), each of which is sufficient to learn the target concept. In this paper we make several contributions. First, we introduce Co-Testing, which is the first approach to multi-view active learning. Second, we extend the multi-view learning framework by also exploiting weak views, which are adequate only for learning a concept that is more general/specific than the target concept. Finally, we empirically show that Co-Testing outperforms existing active learners on a variety of real world domains such as wrapper induction, Web page classification, advertisement removal, and discourse tree parsing.
引用
收藏
页码:203 / 233
页数:31
相关论文
共 84 条
[11]  
ANGLUIN D, 1994, YALEUDCSTR1019
[12]  
[Anonymous], P 17 INT C MACH LEAR
[13]  
[Anonymous], 1994, SIGIR
[14]  
[Anonymous], 2000, NIPS
[15]   Committee-based sample selection for probabilistic classifiers [J].
Argamon-Engelson, S ;
Dagan, I .
JOURNAL OF ARTIFICIAL INTELLIGENCE RESEARCH, 1999, 11 :335-360
[16]   NEURAL NET ALGORITHMS THAT LEARN IN POLYNOMIAL-TIME FROM EXAMPLES AND QUERIES [J].
BAUM, EB .
IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (01) :5-19
[17]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[18]   Learning with unreliable boundary queries [J].
Blum, A ;
Chalasani, P ;
Goldman, SA ;
Slonim, DK .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 1998, 56 (02) :209-222
[19]  
Blum A., 1994, Proceedings of the Twenty-Sixth Annual ACM Symposium on the Theory of Computing, P253, DOI 10.1145/195058.195147
[20]  
Brefeld U, 2004, Proceedings of the 21st International Conference on Machine Learning, P16