A critical analysis of variants of the AUC

被引:28
作者
Vanderlooy, Stijn [2 ]
Huellermeier, Eyke [1 ]
机构
[1] Univ Marburg, Dept Math & Comp Sci, Marburg, Germany
[2] Maastricht Univ, Dept Comp Sci, MICC, Maastricht, Netherlands
关键词
ROC analysis; area under the ROC curve; ranking performance; bias-variance analysis; AUC variants; AUC maximization;
D O I
10.1007/s10994-008-5070-x
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
The area under the ROC curve, or AUC, has been widely used to assess the ranking performance of binary scoring classifiers. Given a sample, the metric considers the ordering of positive and negative instances, i.e., the sign of the corresponding score differences. From a model evaluation and selection point of view, it may appear unreasonable to ignore the absolute value of these differences. For this reason, several variants of the AUC metric that take score differences into account have recently been proposed. In this paper, we present a unified framework for these metrics and provide a formal analysis. We conjecture that, despite their intuitive appeal, actually none of the variants is effective, at least with regard to model evaluation and selection. An extensive empirical analysis corroborates this conjecture. Our findings also shed light on recent research dealing with the construction of AUC-optimizing classifiers.
引用
收藏
页码:247 / 262
页数:16
相关论文
共 24 条
[1]  
[Anonymous], 2005, Data Mining Pratical Machine Learning Tools and Techniques
[2]  
[Anonymous], P 2 WORKSH ROC AN MA
[3]  
[Anonymous], 2004, ICML
[4]  
Asuncion A., 2007, UCI MACHINE LEARNING
[5]   The use of the area under the roc curve in the evaluation of machine learning algorithms [J].
Bradley, AP .
PATTERN RECOGNITION, 1997, 30 (07) :1145-1159
[6]  
BREFELD U, 2005, P 2 WORKSH ROC AN MA
[7]  
Calders T, 2007, LECT NOTES ARTIF INT, V4702, P42
[8]  
Caruana R, 2006, P 23 INT C MACH LEAR, P161, DOI [10.1145/1143844.1143865, DOI 10.1145/1143844.1143865, DOI 10.1145/1143844]
[9]  
Cortes Corinna, 2003, Advances in neural information processing systems, V16
[10]  
FERRI C, 2003, P 14 EUR C MACH LEAR, P121