泛化误差的各种交叉验证估计方法综述

被引：69

作者：

杨柳 ^{[1
]}

王钰 ^{[2
]}

机构：

[1] 山西财经大学应用数学学院

[2] 山西大学计算机中心

来源：

计算机应用研究 | 2015年 / 32卷 / 05期

关键词：

机器学习; 泛化误差; 交叉验证; 偏差; 方差;

D O I：

暂无

中图分类号：

TP181 [自动推理、机器学习];

学科分类号：

摘要：

在机器学习中,泛化误差(预测误差)是用于算法性能度量最常用的指标,然而由于数据的分布未知,泛化误差不能被直接计算,实际中常常通过各种形式的交叉验证方法来估计泛化误差。详细地分析了泛化误差的各交叉验证估计方法的优缺点,对照了各种方法之间的差异,提出和分析了各方法中有待进一步研究的问题和方向。

引用

页码：1287 / 1290+1297 +1297

页数：5

共 11 条

[1] 汉语框架语义角色的自动标注 [J].

李济洪 ;

王瑞波 ;

王蔚林 ;

李国臣 .

软件学报, 2010, 21 (04) :597-611

[2]

基于生物信息数据的几种交叉验证方法比较[D]. 胡军艳.山西大学 2013

[3]

统计学习理论的本质[M]. 清华大学出版社 , (美)VladimirN.Vapnik著, 2000

[4] Blocked 3x2 Cross-Validated t-Test for Comparing Supervised Classification Learning Algorithms [J].

Wang Yu ;

Wang Ruibo ;

Jia Huichen ;

Li Jihong .

NEURAL COMPUTATION, 2014, 26 (01) :208-235

[5] Model selection by bootstrap penalization for classification [J].

Magalie Fromont .

Machine Learning, 2007, 66 :165-207

[6] Inference for the generalization error [J].

Nadeau, C ;

Bengio, Y .

MACHINE LEARNING, 2003, 52 (03) :239-281

[7]

Bayesian measures of model complexity and fit[J] . David J.Spiegelhalter,Nicola G.Best,Bradley P.Carlin,AngelikaVan Der Linde.Journal of the Royal Statistical Society: Series B （Statistical Methodology） . 2002 (4)

[8] Bayesian model selection and model averaging [J].

Wasserman, L .

JOURNAL OF MATHEMATICAL PSYCHOLOGY, 2000, 44 (01) :92-107

[9]

Smoothing noisy data with spline functions[J] . Peter Craven,Grace Wahba.Numerische Mathematik . 1978 (4)

[10]

Omnivariate rule induction using a novel pairwise statistical test. Yildiz O T. IEEE Transactions on Knowledge and Data Engineering . 2013

← 1 2 →