MULTIREADER RECEIVER OPERATING CHARACTERISTIC STUDIES - A COMPARISON OF STUDY DESIGNS

被引:83
作者
OBUCHOWSKI, NA [1 ]
机构
[1] CLEVELAND CLIN FDN,DEPT RADIOL,CLEVELAND,OH 44195
关键词
RECEIVER OPERATING CHARACTERISTIC CURVE; MULTIREADER STUDIES; STUDY DESIGN; STATISTICAL POWER;
D O I
10.1016/S1076-6332(05)80441-6
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives. Traditionally, multireader receiver operating characteristic (ROC) studies have used a ''paired-case, paired-reader'' design. The statistical power of such a design for inferences about the relative accuracies of the tests was assessed and compared with alternative designs. Methods. The noncentrality parameter of an F statistic was used to compute power as a function of the reader and patient sample sizes and the variability and correlations between readings. Results. For a fixed-power and Type I error rate, the traditional design reduces the number of verified cases required. A hybrid design, in which each reader interprets a different sample of patients, reduces the number of readers, total readings, and readings required per reader. The drawback is a substantial increase in the number of verified cases. Conclusion. The ultimate choice of study design depends on the nature of the tests being compared, limiting resources, a priori knowledge of the magnitude of the correlations and variability, and logistic complexity.
引用
收藏
页码:709 / 716
页数:8
相关论文
共 10 条
[1]  
Dorfman, Berbaum, Metz, Receiver operating characteristic rating analysis: generalization to the population of readers and patients with the jackknife method, Invest Radiol, 27, pp. 723-731, (1992)
[2]  
Toledano, Gatsonis, Regression analysis of correlated receiver operating characteristic data, Acad Radiol, 2, pp. S30-S36, (1995)
[3]  
Obuchowski, Multireader, multimodality receiver operating characteristic studies: hypothesis testing and sample size estimation using an analysis of variance approach with dependent observations, Acad Radiol, 2, pp. S22-S29, (1995)
[4]  
Swets, Pickett, Evaluation of diagnostic systems: methods from signal detection theory, pp. 81-93, (1982)
[5]  
Hanley, McNeil, Comparing the areas under receiver operating characteristic curves derived from the same cases, Radiology, 148, pp. 839-843, (1983)
[6]  
Metz, Wang, Kronman, A new approach for testing the significance of differences between ROC curves measured from correlated data, Information processing in medical imaging, pp. 432-445, (1984)
[7]  
Delong, Delong, Clarke-Pearson, Comparing the areas under two or more correlated receiver operating characteristic curves a non-parametric approach, Biometrics, 44, pp. 837-845, (1988)
[8]  
Slasky, Gur, Good, Et al., Receiver operating characteristic analysis of chest image interpretation with conventional, laser-printed, and high-resolution work-station images, Radiology, 174, pp. 775-780, (1990)
[9]  
Obuchowski, Computing sample size for receiver operating characteristic studies, Invest Radiol, 29, pp. 238-243, (1994)
[10]  
Walsh, Concerning the effects of intraclass correlation on certain significance tests, The Annals of Mathematical Statistics, 18, pp. 88-96, (1947)