Sensitivity Analysis of k-Fold Cross Validation in Prediction Error Estimation

被引：1340

作者：

Diego Rodriguez, Juan ^{[1
]}

Perez, Aritz ^{[1
]}

Antonio Lozano, Jose ^{[1
]}

机构：

[1] Univ Basque Country UPV EHU, Fac Comp Sci, Intelligent Syst Grp, E-20018 Donostia San Sebastian, Gipuzkoa, Spain

来源：

IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE | 2010年 / 32卷 / 03期

关键词：

k-fold cross validation; prediction error; error estimation; bias and variance; decomposition of the variance; sources of sensitivity; supervised classification; VARIANCE; BIAS;

D O I：

10.1109/TPAMI.2009.187

中图分类号：

TP18 [人工智能理论];

学科分类号：

081104 ; 0812 ; 0835 ; 1405 ;

摘要：

In the machine learning field, the performance of a classifier is usually measured in terms of prediction error. In most real-world problems, the error cannot be exactly calculated and it must be estimated. Therefore, it is important to choose an appropriate estimator of the error. This paper analyzes the statistical properties, bias and variance, of the k-fold cross-validation classification error estimator (k-cv). Our main contribution is a novel theoretical decomposition of the variance of the k-cv considering its sources of variance: sensitivity to changes in the training set and sensitivity to changes in the folds. The paper also compares the bias and variance of the estimator for different values of k. The experimental study has been performed in artificial domains because they allow the exact computation of the implied quantities and we can rigorously specify the conditions of experimentation. The experimentation has been performed for two classifiers (naive Bayes and nearest neighbor), different numbers of folds, sample sizes, and training sets coming from assorted probability distributions. We conclude by including some practical recommendation on the use of k-fold cross validation.

引用

页码：569 / 575

页数：7

共 28 条

[1]

[Anonymous], 1995, THESIS STANFORD U

[2]

[Anonymous], 1973, Pattern Classification and Scene Analysis

[3]

[Anonymous], 2006, Pattern recognition and machine learning

[4]

Bengio Y, 2004, J MACH LEARN RES, V5, P1089

[5]

Bengio Y, 2005, GERAD 25TH ANNIV SER, V1, P75

[6]

BRAGA UM, 2005, P SOC PHOTO-OPT INS, P304

[7] Is cross-validation better than resubstitution for ranking genes? [J].

Braga-Neto, U ;

Hashimoto, R ;

Dougherty, ER ;

Nguyen, DV ;

Carroll, RJ .

BIOINFORMATICS, 2004, 20 (02) :253-258

[8] Is cross-validation valid for small-sample microarray classification? [J].

Braga-Neto, UM ;

Dougherty, ER .

BIOINFORMATICS, 2004, 20 (03) :374-380

[9]

Demsar J, 2006, J MACH LEARN RES, V7, P1

[10]

Devroye L., 1985, Nonparametric density estimation: the L 1 view

← 1 2 3 →