Multitask learning

被引:4365
作者
Caruana, R
机构
[1] School of Computer Science, Carnegie Mellon University, Pittsburgh
基金
美国国家科学基金会;
关键词
inductive transfer; parallel transfer; multitask learning; backpropagation; k-nearest neighbor; kernel regression; supervised learning; generalization;
D O I
10.1023/A:1007379606734
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Multitask Learning is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias. It does this by learning tasks in parallel while using a shared representation; what is learned For each task can help other tasks be learned better. This paper reviews prior work on MTL, presents new evidence that MTL in backprop nets discovers task relatedness without the need of supervisory signals, and presents new results for MTL with k-nearest neighbor and kernel regression. In this paper we demonstrate multitask learning in three domains. We explain how multitask learning works, and show that there are many opportunities for multitask learning in real domains. We present an algorithm and results for multitask learning with case-based methods like k-nearest neighbor and kernel regression, and sketch an algorithm for multitask learning in decision trees. Because multitask learning works, can be applied to many different kinds of domains, and can be used with different learning algorithms, we conjecture there will be many opportunities for its use on real-world problems.
引用
收藏
页码:41 / 75
页数:35
相关论文
共 62 条
  • [1] Abu-Mostafa Y. S., 1990, Journal of Complexity, V6, P192, DOI 10.1016/0885-064X(90)90006-Y
  • [2] HINTS
    ABUMOSTAFA, YS
    [J]. NEURAL COMPUTATION, 1995, 7 (04) : 639 - 671
  • [3] ABUMOSTAFA YS, 1993, NEURAL COMPUTATION, V5
  • [4] [Anonymous], P 10 INT C MACH LEAR
  • [5] [Anonymous], THESIS CARNEGIE MELL
  • [6] [Anonymous], [No title captured]
  • [7] [Anonymous], 1986, JHUEECS8601
  • [8] [Anonymous], 1996, EXPLANATION BASED NE
  • [9] Baluja S, 1995, INT JOINT CONF ARTIF, P133
  • [10] BAXTER J, 1995, P 8 ACM C COMP LEARN