An iterative algorithm for extending learners to a semi-supervised setting

被引:30
作者
Culp, Mark [1 ]
Michailidis, George [2 ]
机构
[1] W Virginia Univ, Dept Stat, Morgantown, WV 26506 USA
[2] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA
关键词
convergence; iterative algorithm; linear smoothers; semi-supervised learning;
D O I
10.1198/106186008X344748
中图分类号
O21 [概率论与数理统计]; C8 [统计学];
学科分类号
020208 ; 070103 ; 0714 ;
摘要
In this article, we present an iterative self-training algorithm whose objective is to extend learners from a supervised setting into a semi-supervised setting. The algorithm is based on using the predicted values for observations where the response is missing (unlabeled data) and then incorporating the predictions appropriately at subsequent stages. Convergence properties of the algorithm are investigated for particular learners, such as linear/logistic regression and linear smoothers with particular emphasis on kernel smoothers. Further, implementation issues of the algorithm with other learners such as generalized additive models, tree partitioning methods, partial least squares, etc. are also addressed. The connection between the proposed algorithm and graph-based semi-supervised learning methods is also discussed. The algorithm is illustrated on a number of real datasets using a varying degree of labeled responses.
引用
收藏
页码:545 / 571
页数:27
相关论文
共 30 条
[1]  
ABNEY S, 2004, COMPUTATIONAL LINGUI, V3, P365
[2]  
[Anonymous], SEMI SUPERVISED LEAR
[3]  
BALUJA S, 1999, NIPS, P854
[4]  
Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962
[5]  
Blum A., 2001, P 18 INT C MACH LEAR, P19, DOI DOI 10.1184/R1/6606860.V1
[6]   LINEAR SMOOTHERS AND ADDITIVE-MODELS - REJOINDER [J].
BUJA, A ;
HASTIE, T ;
TIBSHIRANI, R .
ANNALS OF STATISTICS, 1989, 17 (02) :543-555
[7]   The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter [J].
Castelli, V ;
Cover, TM .
IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (06) :2102-2117
[8]  
Chapelle O., 2006, ICML
[9]  
Chapelle O., 2006, SEMISUPERVISED LEARN, P1
[10]  
CLEVELAND J, 1983, GRAPHICAL METHODS DA