An empirical comparison of discrete ratings and subjective probability ratings

被引:14
作者
Berbaum, KS [1 ]
Dorfman, DD
Franken, EA
Caldwell, RT
机构
[1] Univ Iowa, Med Labs 3170, Dept Radiol, Iowa City, IA 52242 USA
[2] Univ Iowa, Med Labs 3170, Dept Psychol, Iowa City, IA 52242 USA
关键词
diagnostic radiology; observer performance; images; interpretation; quality assurance; receiver operating characteristic curve (ROC);
D O I
10.1016/S1076-6332(03)80344-6
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives. The authors compared receiver operating characteristic (ROC) data from a five-category discrete scale with that from a 101-category subjective probability scale to determine how well the latter categories define the ROC curve. Materials and Methods. The authors analyzed data from a pilot study performed for another purpose in which 10 radiologists provided both a five-point confidence rating and a subjective probability rating of abnormality for each interpretation. ROC operating points were plotted for a five-category scale and a 101-category scale to determine how well the observed points covered the range of false-positive probabilities. ROC curves were fitted to the subjective probability data according to the standard ROC model. Results. For these data, subjective probability ratings were somewhat more effective in populating the range of false-positive probability with ROC points. For three observers, the ROC curves inappropriately crossed the chance line. For another four, prevention of such crossing seemed to depend on one or two ROC points near the upper right corner of the ROC space, points based on discriminations within the discrete category "no abnormality to report." Conclusion. Subjective probability rating should provide substantially better coverage of the ROC space with operating points, preventing inappropriate crossing of the chance line. Unfortunately, the protection offered by subjective probability ratings was unreliable and depended on ROC points derived from discriminations not directly related to apparent abnormality. The use of proper ROC models to fit data may offer a better solution.
引用
收藏
页码:756 / 763
页数:8
相关论文
共 23 条
[1]   Proper ROC analysis and joint ROC analysis of the satisfaction of search effect in chest radiology [J].
Berbaum, KS ;
Dorfman, DD ;
Franken, EA ;
Caldwell, RT .
ACADEMIC RADIOLOGY, 2000, 7 (11) :945-958
[2]   Gaze dwell times on acute trauma injuries missed because of satisfaction of search [J].
Berbaum, KS ;
Brandser, EA ;
Franken, EA ;
Dorfman, DD ;
Caldwell, RT ;
Krupinski, EA .
ACADEMIC RADIOLOGY, 2001, 8 (04) :304-314
[3]  
Dorfman D. D., 1982, EVALUATION DIAGNOSTI, P212
[4]   MAXIMUM-LIKELIHOOD ESTIMATION OF PARAMETERS OF SIGNAL-DETECTION THEORY AND DETERMINATION OF CONFIDENCE INTERVALS - RATING-METHOD DATA [J].
DORFMAN, DD ;
ALF, E .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1969, 6 (03) :487-&
[5]   DEGENERACY AND DISCRETE RECEIVER OPERATING CHARACTERISTIC RATING DATA [J].
DORFMAN, DD ;
BERBAUM, KS .
ACADEMIC RADIOLOGY, 1995, 2 (10) :907-915
[6]   Proper receiver operating characteristic analysis: The bigamma model [J].
Dorfman, DD ;
Berbaum, KS ;
Metz, CE ;
Lenth, RV ;
Hanley, JA ;
AbuDagga, H .
ACADEMIC RADIOLOGY, 1997, 4 (02) :138-149
[7]   A contaminated binormal model for ROC data - Part I. Some interesting examples of binormal degeneracy [J].
Dorfman, DD ;
Berbaum, KS ;
Brandser, EA .
ACADEMIC RADIOLOGY, 2000, 7 (06) :420-426
[8]   A contaminated binormal model for ROC data - Part III. Initial evaluation with detection ROC data [J].
Dorfman, DD ;
Berbaum, KS .
ACADEMIC RADIOLOGY, 2000, 7 (06) :438-447
[9]   A contaminated binormal model for ROC data - Part II. A formal model [J].
Dorfman, DD ;
Berbaum, KS .
ACADEMIC RADIOLOGY, 2000, 7 (06) :427-437
[10]  
EGAN JP, 1975, SIGNAL DETECTION THE, P37