The hypervolume under the ROC hypersurface of "near-guessing" and "near-perfect" observers in N-class classification tasks

被引:37
作者
Edwards, DC [1 ]
Metz, CE [1 ]
Nishikawa, RM [1 ]
机构
[1] Univ Chicago, Dept Radiol, Chicago, IL 60637 USA
关键词
N-class classification; ROC analysis; ROC performance metrics;
D O I
10.1109/TMI.2004.841227
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We express the performance of the N-class "guessing" observer in terms of the N-2 - N conditional probabilities which make up an N-class receiver operating characteristic (ROC) space, in a formulation in which sensitivities are eliminated in constructing the ROC space (equivalent to using false-negative fraction and false-positive fraction in a two-class task). We then show that the "guessing" observer's performance in terms of these conditional probabilities is completely described by a degenerate hypersurface with only N - I degrees of freedom (as opposed to the N-2 - N - 1 required, in general, to achieve a true hypersurface in such a ROC space). It readily follows that the hypervolume under such a degenerate hypersurface must be zero when N > 2. We then consider a "near-guessing" task; that is, a task in which the N underlying data probability density functions (pdfs) are nearly identical, controlled by N - 1 parameters which may vary continuously to zero (at which point the pdfs become identical). With this approach, we show that the hypervolume under the ROC hypersurface of an observer in an N-class classification task tends continuously to zero as the underlying data pdfs converge continuously to identity (a "guessing" task). The hypervolume under the ROC hypersurface of a "perfect" ideal observer (in a task in which the N data pdfs never overlap) is also found to be zero in the ROC space formulation under consideration. This suggests that hypervolume may not be a useful performance metric in N-class classification tasks for N > 2, despite the utility of the area under the ROC curve for two-class tasks.
引用
收藏
页码:293 / 299
页数:7
相关论文
共 22 条
[1]  
[Anonymous], 1992, BAYESIAN METHODS ADA
[2]  
[Anonymous], 1968, SER DETECTION ESTIMA
[3]   AUTOMATED SEGMENTATION OF DIGITIZED MAMMOGRAMS [J].
BICK, U ;
GIGER, ML ;
SCHMIDT, RA ;
NISHIKAWA, RM ;
WOLVERTON, DE ;
DOI, K .
ACADEMIC RADIOLOGY, 1995, 2 (01) :1-9
[4]   Design of three-class classifiers in computer-aided diagnosis: Monte Carlo simulation study [J].
Chan, HP ;
Sahiner, B ;
Hadjiiski, LM ;
Petrick, N ;
Zhou, C .
MEDICAL IMAGING 2003: IMAGE PROCESSING, PTS 1-3, 2003, 5032 :567-578
[5]   Comparing three-class diagnostic tests by three-way ROC analysis [J].
Dreiseitl, S ;
Ohno-Machado, L ;
Binder, M .
MEDICAL DECISION MAKING, 2000, 20 (03) :323-331
[6]   Ideal observers and optimal ROC hypersurfaces in N-class classification [J].
Edwards, DC ;
Metz, CE ;
Kupinski, MA .
IEEE TRANSACTIONS ON MEDICAL IMAGING, 2004, 23 (07) :891-895
[7]   Estimating three-class ideal observer decision variables for computerized detection and classification of mammographic mass lesions [J].
Edwards, DC ;
Lan, L ;
Metz, CE ;
Giger, ML ;
Nishikawa, RM .
MEDICAL PHYSICS, 2004, 31 (01) :81-90
[8]   Estimation of three-class ideal observer decision functions with a Bayesian artificial neural network [J].
Edwards, DC ;
Metz, CE ;
Nishikawa, RM .
MEDICAL IMAGING 2002: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2002, 4686 :1-12
[9]  
GROSSMAN S, 1986, MULTIVARIABLE CALCUL
[10]   Breast cancer: Effectiveness of computer-aided diagnosis - Observer study with independent database of mammograms [J].
Huo, ZM ;
Giger, ML ;
Vyborny, CJ ;
Metz, CE .
RADIOLOGY, 2002, 224 (02) :560-568