Active learning with multiple views

被引:142
作者
Muslea, Ion
Minton, Steven
Knoblock, Craig A.
机构
[1] Language Weaver Inc, Marina Del Rey, CA 90292 USA
[2] Fetch Technol Inc, El Segundo, CA 90245 USA
[3] Univ So Calif, Marina Del Rey, CA 90292 USA
基金
美国国家科学基金会;
关键词
D O I
10.1613/jair.2005
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Active learners alleviate the burden of labeling large amounts of data by detecting and asking the user to label only the most informative examples in the domain. We focus here on active learning for multi-view domains, in which there are several disjoint subsets of features (views), each of which is sufficient to learn the target concept. In this paper we make several contributions. First, we introduce Co-Testing, which is the first approach to multi-view active learning. Second, we extend the multi-view learning framework by also exploiting weak views, which are adequate only for learning a concept that is more general/specific than the target concept. Finally, we empirically show that Co-Testing outperforms existing active learners on a variety of real world domains such as wrapper induction, Web page classification, advertisement removal, and discourse tree parsing.
引用
收藏
页码:203 / 233
页数:31
相关论文
共 84 条
[1]  
ABE N, 1998, P 15 INT C MACH LEAR, P1
[2]  
Abney S, 2002, 40TH ANNUAL MEETING OF THE ASSOCIATION FOR COMPUTATIONAL LINGUISTICS, PROCEEDINGS OF THE CONFERENCE, P360
[3]   TOLERATING NOISY, IRRELEVANT AND NOVEL ATTRIBUTES IN INSTANCE-BASED LEARNING ALGORITHMS [J].
AHA, DW .
INTERNATIONAL JOURNAL OF MAN-MACHINE STUDIES, 1992, 36 (02) :267-287
[4]  
Amoth T. R., 1999, Proceedings of the Twelfth Annual Conference on Computational Learning Theory, P323, DOI 10.1145/307400.307486
[5]  
Amoth T. R., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P175, DOI 10.1145/279943.279980
[6]  
Angluin D., 1988, Machine Learning, V2, P319, DOI 10.1007/BF00116828
[7]   A NOTE ON THE NUMBER OF QUERIES NEEDED TO IDENTIFY REGULAR LANGUAGES [J].
ANGLUIN, D .
INFORMATION AND CONTROL, 1981, 51 (01) :76-87
[8]   RANDOMLY FALLIBLE TEACHERS - LEARNING MONOTONE DNF WITH AN INCOMPLETE MEMBERSHIP ORACLE [J].
ANGLUIN, D ;
SLONIM, DK .
MACHINE LEARNING, 1994, 14 (01) :7-26
[9]   Malicious omissions and errors in answers to membership queries [J].
Angluin, D ;
Krikis, M ;
Sloan, RH ;
Turan, G .
MACHINE LEARNING, 1997, 28 (2-3) :211-255
[10]  
ANGLUIN D, 1994, YALEUDCSTR1020