A theory of transfer learning with applications to active learning

被引:108
作者
Yang, Liu [1 ]
Hanneke, Steve [2 ]
Carbonell, Jaime [3 ]
机构
[1] Carnegie Mellon Univ, Machine Learning Dept, Pittsburgh, PA 15213 USA
[2] Carnegie Mellon Univ, Dept Stat, Pittsburgh, PA 15213 USA
[3] Carnegie Mellon Univ, Language Technol Inst, Pittsburgh, PA 15213 USA
基金
美国国家科学基金会;
关键词
Transfer learning; Multi-task learning; Active learning; Statistical learning theory; Bayesian learning; Sample complexity; MULTIPLE TASKS; CONVERGENCE; RATES; MODEL;
D O I
10.1007/s10994-012-5310-y
中图分类号
TP18 [人工智能理论];
学科分类号
140502 [人工智能];
摘要
We explore a transfer learning setting, in which a finite sequence of target concepts are sampled independently with an unknown distribution from a known family. We study the total number of labeled examples required to learn all targets to an arbitrary specified expected accuracy, focusing on the asymptotics in the number of tasks and the desired accuracy. Our primary interest is formally understanding the fundamental benefits of transfer learning, compared to learning each target independently from the others. Our approach to the transfer problem is general, in the sense that it can be used with a variety of learning protocols. As a particularly interesting application, we study in detail the benefits of transfer for self-verifying active learning; in this setting, we find that the number of labeled examples required for learning with transfer is often significantly smaller than that required for learning each target independently.
引用
收藏
页码:161 / 189
页数:29
相关论文
共 48 条
[1]
Ando RK, 2005, J MACH LEARN RES, V6, P1817
[2]
[Anonymous], ACM SIGKDD C KNOWL D
[3]
[Anonymous], 2004, P ADV NEUR INF PROC
[4]
[Anonymous], P 46 ANN ALL C COMM
[5]
[Anonymous], P 17 INT C ALG LEARN
[6]
[Anonymous], 1996, Advances in Neural Information Processing Systems
[7]
[Anonymous], P 22 C LEARN THEOR
[8]
[Anonymous], 2007, P 20 C LEARN THEOR
[9]
The true sample complexity of active learning [J].
Balcan, Maria-Florina ;
Hanneke, Steve ;
Vaughan, Jennifer Wortman .
MACHINE LEARNING, 2010, 80 (2-3) :111-139
[10]
Agnostic active learning [J].
Balcan, Maria-Florina ;
Beygelzimer, Alina ;
Langford, John .
JOURNAL OF COMPUTER AND SYSTEM SCIENCES, 2009, 75 (01) :78-89