Blocked 3x2 Cross-Validated t-Test for Comparing Supervised Classification Learning Algorithms

被引:32
作者
Wang Yu [1 ]
Wang Ruibo [1 ]
Jia Huichen [2 ]
Li Jihong [1 ]
机构
[1] Shanxi Univ, Ctr Comp, Taiyuan 030006, Peoples R China
[2] Shanxi Univ, Sch Math Sci, Taiyuan 030006, Peoples R China
关键词
VARIANCE;
D O I
10.1162/NECO_a_00532
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
In the research of machine learning algorithms for classification tasks, the comparison of the performances of algorithms is extremely important, and a statistical test of significance for generalization error is often used to perform it in the machine learning literature. In view of the randomness of partitions in cross-validation, a new blocked 3x2 cross-validation is proposed to estimate generalization error in this letter. We then conduct an analysis of variance of the blocked 3x2 cross-validated estimator. A relatively conservative variance estimator that considers the correlation between any two two-fold cross-validations, and was previously neglected in 5x2 cross-validated t and F-tests is put forward. A corresponding test using this variance estimator is presented to compare the performances of algorithms. Simulated results show that the performance of our test is comparable with that of 5x2 cross-validated tests but with less computation complexity.
引用
收藏
页码:208 / 235
页数:28
相关论文
共 14 条
[1]   Combined 5 x 2 cv F test for comparing supervised classification learning algorithms [J].
Alpaydin, E .
NEURAL COMPUTATION, 1999, 11 (08) :1885-1892
[2]  
Alpaydin E., 2010, Introduction to Machine Learning, V2
[3]  
Bengio Y, 2004, J MACH LEARN RES, V5, P1089
[4]  
Bouckaert R. R., 2005, P ICML 2005 WORKSH M
[5]  
Bouckaert RR, 2004, LECT NOTES ARTIF INT, V3056, P3
[6]   Classifier variability: Accounting for training and testing [J].
Chen, Weijie ;
Gallas, Brandon D. ;
Yousef, Waleed A. .
PATTERN RECOGNITION, 2012, 45 (07) :2661-2671
[7]   Approximate statistical tests for comparing supervised classification learning algorithms [J].
Dietterich, TG .
NEURAL COMPUTATION, 1998, 10 (07) :1895-1923
[8]  
Grandvalet Y., 2006, 1285 U MONTR
[9]  
Li J., 2010, THESIS SHANXI U
[10]  
Li Jun-cai, 2010, Xibei Zhiwu Xuebao, V30, P30