Active learning with statistical models

被引:1023
作者
Cohn, DA
Ghahramani, Z
Jordan, MI
机构
[1] Ctr. for Biol./Compl. Learning, Dept. of Brain Sciences, Massachusetts Inst. of Technology, Cambridge
关键词
D O I
10.1613/jair.295
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
For many types of machine learning algorithms, one can compute the statistically ''optimal'' way to select training data. In this paper, we review how optimal data selection techniques have been used with feedforward neural networks. We then show how the same principles may be used to select data for two alternative, statistically-based learning architectures: mixtures of Gaussians and locally weighted regression. While the techniques for neural networks are computationally expensive and approximate, the techniques for mixtures of Gaussians and locally weighted regression are both efficient and accurate. Empirically, we observe that the optimality criterion sharply decreases the number of training examples the learner needs in order to achieve good performance.
引用
收藏
页码:129 / 145
页数:17
相关论文
共 28 条
  • [21] SELECTING CONCISE TRAINING SETS FROM CLEAN DATA
    PLUTOWSKI, M
    WHITE, H
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1993, 4 (02): : 305 - 318
  • [22] ROBOT JUGGLING - IMPLEMENTATION OF MEMORY-BASED LEARNING
    SCHAAL, S
    ATKESON, CG
    [J]. IEEE CONTROL SYSTEMS MAGAZINE, 1994, 14 (01): : 57 - 71
  • [23] SCHMIDHUBER J, 1993, REINFORCEMENT DRIVEN
  • [24] A GENERAL REGRESSION NEURAL NETWORK
    SPECHT, DF
    [J]. IEEE TRANSACTIONS ON NEURAL NETWORKS, 1991, 2 (06): : 568 - 576
  • [25] THRUN S, 1992, ADV NEURAL INFORMATI, V4
  • [26] Titterington D.M., 1985, Statistical analysis of finite mixture distributions
  • [27] WEISBERG S, 1985, APPLIED LINEAR REGRE
  • [28] WHITEHEAD S, 1991, CS365 U ROCH