Combined 5 x 2 cv F test for comparing supervised classification learning algorithms

被引:319
作者
Alpaydin, E
机构
[1] IDIAP, CH-1290 Martigny, Switzerland
[2] Bogazici Univ, Dept Comp Engn, TR-80815 Istanbul, Turkey
关键词
D O I
10.1162/089976699300016007
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Dietterich (1998) reviews five statistical tests and proposes the 5 x 2 cv t test for determining whether there is a significant difference between the error rates of two classifiers. In our experiments, we noticed that the 5 x 2 cv t test result may vary depending on factors that should not affect the test, and we propose a variant, the combined 5 x 2 cv F test, that combines multiple statistics to get a more robust test. Simulation results show that this combined version of the test has lower type I error and higher power than 5 x 2 cv proper.
引用
收藏
页码:1885 / 1892
页数:8
相关论文
共 3 条