Observer studies involving detection and localization: Modeling, analysis, and validation

被引:303
作者
Chakraborty, DP
Berbaum, KS
机构
[1] Univ Pittsburgh, Dept Radiol, Pittsburgh, PA 15213 USA
[2] Univ Iowa, Dept Radiol, Iowa City, IA 52242 USA
关键词
observer performance; ROC analysis; FROC analysis; localization; statistical power;
D O I
10.1118/1.1769352
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Although the receiver operating characteristic (ROC) paradigm is the accepted method for evaluation of diagnostic imaging systems, it has some serious shortcomings inasmuch as it is restricted to one observer report per image. By contrast the free-response ROC (FROC) paradigm and associated analysis method allows the observer to report multiple abnormalities within each imaging study, and uses the location of reported abnormalities to improve the measurement. Because the ROC method cannot accommodate multiple responses or use location information, its statistical power will suffer. The FROC paradigm/analysis has not enjoyed widespread acceptance because of concern about whether responses made to the same diagnostic study can be treated as independent. We propose a new jackknife FROC analysis method (JAFROC) that does not make the independence assumption. The new analysis method combines elements of FROC and the Dorfman -Berbaum-Metz (DBM) methods. To compare JAFROC to an earlier free-response analysis method (specifically the alternative free-response, or AFROC method), and to the DBM method, which uses conventional ROC scoring, we developed a model for generating simulated FROC data. The simulation model is based on an eye-movement model of how experts evaluate images. It allowed us to examine null hypothesis (NH) behavior and statistical power of the different methods. We found that AFROC analysis did not pass the NH test, being unduly conservative. Both the JAFROC method and the DBM method passed the NH test, but JAFROC had more statistical power than the DBM method. The results of this comparison suggest that future studies of diagnostic performance may enjoy improved statistical power or reduced sample size requirements through the use of the JAFROC method. (C) 2004 American Association of Physicists in Medicine.
引用
收藏
页码:2313 / 2330
页数:18
相关论文
共 39 条
[1]  
Berbaum K. S., 1994, Emergency Radiology, V1, P242, DOI [10.1007/BF02614935, DOI 10.1007/BF02614935]
[2]   Role of faulty visual search in the satisfaction of search effect in chest radiography [J].
Berbaum, KS ;
Franken, EA ;
Dorfman, DD ;
Miller, EM ;
Caldwell, RT ;
Kuehn, DM ;
Berbaum, ML .
ACADEMIC RADIOLOGY, 1998, 5 (01) :9-19
[3]   SATISFACTION OF SEARCH IN DIAGNOSTIC-RADIOLOGY [J].
BERBAUM, KS ;
FRANKEN, EA ;
DORFMAN, DD ;
ROOHOLAMINI, SA ;
KATHOL, MH ;
BARLOON, TJ ;
BEHLKE, FM ;
SATO, Y ;
LU, CH ;
ELKHOURY, GY ;
FLICKINGER, FW ;
MONTGOMERY, WJ .
INVESTIGATIVE RADIOLOGY, 1990, 25 (02) :133-140
[4]  
Botsco M, 1999, MAMMOGRAPHY QUALITY
[5]  
BUNCH PC, 1978, J APPL PHOTOGR ENG, V4, P166
[6]   COMPARISON OF RECEIVER OPERATING CHARACTERISTIC AND FORCED-CHOICE OBSERVER PERFORMANCE-MEASUREMENT METHODS [J].
BURGESS, AE .
MEDICAL PHYSICS, 1995, 22 (05) :643-655
[7]   Statistical power in observer-performance studies: Comparison of the receiver operating characteristic and free-response methods in tasks involving localization [J].
Chakraborty, D .
ACADEMIC RADIOLOGY, 2002, 9 (02) :147-156
[8]   Data analysis for detection and localization of multiple abnormalities with application to mammography -: Authors:: Nancy A.!Obuchowski, PhD, Michael L.!Lieber, MS, Kimerly A.!Powell, PhD -: Publication:: Acad Radiol 2000; 7:516-525 [J].
Chakraborty, DP .
ACADEMIC RADIOLOGY, 2000, 7 (07) :553-554
[9]   Proposed solution to the FROC problem and an invitation to collaborate [J].
Chakraborty, DP .
MEDICAL IMAGING 2003: IMAGE PERCEPTION, OBSERVER PERFORMANCE, AND TECHNOLOGY ASSESSMENT, 2003, 5034 :204-212
[10]   MAXIMUM-LIKELIHOOD ANALYSIS OF FREE-RESPONSE RECEIVER OPERATING CHARACTERISTIC (FROC) DATA [J].
CHAKRABORTY, DP .
MEDICAL PHYSICS, 1989, 16 (04) :561-568