Semi-supervised learning with an imperfect supervisor

被引:22
作者
Amini, MR [1 ]
Gallinari, P [1 ]
机构
[1] Univ Paris 06, Dept Comp Sci, F-75015 Paris, France
关键词
semi-supervised learnin; imperfect supervision; Classification Expectation Maximisation; Classification Maximum Likelihood;
D O I
10.1007/s10115-005-0219-4
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Real-life applications may involve huge data sets with misclassified or partially classified training data. Semi-supervised learning and learning in the presence of label noise have recently emerged as new paradigms in the machine learning community to cope with this kind of problems. This paper describes a new discriminant algorithm for semi-supervised learning. This algorithm optimizes the classification maximum likelihood (CML) of a set of labeled-unlabeled data, using a discriminant extension of the Classification Expectation Maximization algorithm. We further propose to extend this algorithm by modeling imperfections in the estimated class labels for unlabeled data. The parameters of this label-error model are learned together with the semi-supervised classifier parameters. We demonstrate the effectiveness of the approach using extensive experiments on different datasets.
引用
收藏
页码:385 / 413
页数:29
相关论文
共 48 条
  • [1] AITCHISON J, 1976, BIOMETRIKA, V63, P1
  • [2] AMBROISE C, 2000, P 7 INT FED CLASS SO, P161
  • [3] Amini M.-R., 2002, Proceedings of SIGIR 2002. Twenty-Fifth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, P105
  • [4] Amini M.-R., 2003, IJCAI 03 P 18 INT JO, P555
  • [5] Anderson J.A., 1982, Handbook of Statistics, V2, P169
  • [6] ANDERSON JA, 1979, BIOMETRIKA, V66, P17, DOI 10.1093/biomet/66.1.17
  • [7] [Anonymous], P 18 INT C MACH LEAR
  • [8] [Anonymous], P JOINT SIGDAT C EMP
  • [9] [Anonymous], 2000, LEARNING LABELED UNL
  • [10] ATCHISON J, 1986, STAT ANAL COMPOSITIO