An iterative algorithm for extending learners to a semi-supervised setting

被引：30

作者：

Culp, Mark ^{[1
]}

Michailidis, George ^{[2
]}

机构：

[1] W Virginia Univ, Dept Stat, Morgantown, WV 26506 USA

[2] Univ Michigan, Dept Stat, Ann Arbor, MI 48109 USA

来源：

JOURNAL OF COMPUTATIONAL AND GRAPHICAL STATISTICS | 2008年 / 17卷 / 03期

关键词：

convergence; iterative algorithm; linear smoothers; semi-supervised learning;

D O I：

10.1198/106186008X344748

中图分类号：

O21 [概率论与数理统计]; C8 [统计学];

学科分类号：

020208 ; 070103 ; 0714 ;

摘要：

In this article, we present an iterative self-training algorithm whose objective is to extend learners from a supervised setting into a semi-supervised setting. The algorithm is based on using the predicted values for observations where the response is missing (unlabeled data) and then incorporating the predictions appropriately at subsequent stages. Convergence properties of the algorithm are investigated for particular learners, such as linear/logistic regression and linear smoothers with particular emphasis on kernel smoothers. Further, implementation issues of the algorithm with other learners such as generalized additive models, tree partitioning methods, partial least squares, etc. are also addressed. The connection between the proposed algorithm and graph-based semi-supervised learning methods is also discussed. The algorithm is illustrated on a number of real datasets using a varying degree of labeled responses.

引用

页码：545 / 571

页数：27

共 30 条

[1]

ABNEY S, 2004, COMPUTATIONAL LINGUI, V3, P365

[2]

[Anonymous], SEMI SUPERVISED LEAR

[3]

BALUJA S, 1999, NIPS, P854

[4]

Blum A., 1998, Proceedings of the Eleventh Annual Conference on Computational Learning Theory, P92, DOI 10.1145/279943.279962

[5]

Blum A., 2001, P 18 INT C MACH LEAR, P19, DOI DOI 10.1184/R1/6606860.V1

[6] LINEAR SMOOTHERS AND ADDITIVE-MODELS - REJOINDER [J].

BUJA, A ;

HASTIE, T ;

TIBSHIRANI, R .

ANNALS OF STATISTICS, 1989, 17 (02) :543-555

[7] The relative value of labeled and unlabeled samples in pattern recognition with an unknown mixing parameter [J].

Castelli, V ;

Cover, TM .

IEEE TRANSACTIONS ON INFORMATION THEORY, 1996, 42 (06) :2102-2117

[8]

Chapelle O., 2006, ICML

[9]

Chapelle O., 2006, SEMISUPERVISED LEARN, P1

[10]

CLEVELAND J, 1983, GRAPHICAL METHODS DA

← 1 2 3 →