ROC analysis in ordinal regression learning

被引:76
作者
Waegeman, Willem
De Baets, Bernard
Boullart, Luc
机构
[1] Univ Ghent, Dept Elect Energy Syst & Automat, B-9052 Ghent, Belgium
[2] Univ Ghent, Dept Appl Math Biometr & Proc Control, B-9000 Ghent, Belgium
关键词
ROC analysis; ranking; ordinal regression; unbalanced learning problems; performance measures; machine learning;
D O I
10.1016/j.patrec.2007.07.019
中图分类号
TP18 [人工智能理论];
学科分类号
081104 ; 0812 ; 0835 ; 1405 ;
摘要
Nowadays the area under the receiver operating characteristics (ROC) curve, which corresponds to the Wilcoxon-Mann-Whitney test statistic, is increasingly used as a performance measure for binary classification systems. In this article we present a natural generalization of this concept for more than two ordered categories, a setting known as ordinal regression. Our extension of the Wilcoxon-Mann-Whitney statistic now corresponds to the volume under an r-dimensional surface (VUS) for r ordered categories and differs from extensions recently proposed for multi-class classification. VUS rather evaluates the ranking returned by an ordinal regression model instead of measuring the error rate, a way of thinking which has especially advantages with skew class or cost distributions. We give theoretical and experimental evidence of the advantages and different behavior of VUS compared to error rate, mean absolute error and other ranking-based performance measures for ordinal regression. The results demonstrate that the models produced by ordinal regression algorithms minimizing the error rate or a preference learning based loss, not necessarily impose a good ranking on the data. (C) 2007 Elsevier B.V. All rights reserved.
引用
收藏
页码:1 / 9
页数:9
相关论文
共 23 条
[1]  
Agarwal S, 2005, J MACH LEARN RES, V6, P393
[2]  
[Anonymous], 2011, Categorical data analysis
[3]  
Chu W, 2005, J MACH LEARN RES, V6, P1019
[4]  
CHU W, 2005, P INT C MACH LEARN B, P321
[5]  
Cortes Corinna, 2003, Advances in neural information processing systems, V16
[6]  
Crammer K, 2002, ADV NEUR IN, V14, P641
[7]  
Cristianini N., 2000, Intelligent Data Analysis: An Introduction
[8]   Comparing three-class diagnostic tests by three-way ROC analysis [J].
Dreiseitl, S ;
Ohno-Machado, L ;
Binder, M .
MEDICAL DECISION MAKING, 2000, 20 (03) :323-331
[9]   Multi-class ROC analysis from a multi-objective optimisation perspective [J].
Everson, Richard M. ;
Fieldsend, Jonathan E. .
PATTERN RECOGNITION LETTERS, 2006, 27 (08) :918-927
[10]  
Ferri C, 2003, LECT NOTES ARTIF INT, V2837, P108