Confidence intervals for the receiver operating characteristic area in studies with small samples

被引:74
作者
Obuchowski, NA [1 ]
Lieber, ML [1 ]
机构
[1] Cleveland Clin Fdn, Dept Biostat & Epidemiol, Cleveland, OH 44195 USA
关键词
receiver operating characteristic curve; (ROC);
D O I
10.1016/S1076-6332(98)80208-0
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Rationale and Objectives. The authors performed this study to address two practical questions. First, how large does the sample size need to be for confidence intervals (CIs) based on the usual asymptotic methods to be appropriate? Second, when the sample size is smaller than this threshold, what alternative method of CI construction should be used? Materials and Methods. The authors performed a Monte Carlo simulation study were 95% CIs were constructed for the receiver operating characteristic (ROC) area and for the difference between two ROC areas for rating and continuous test results-for ROC areas of moderate and high accuracy-by using both parametric and nonparametric estimation methods. Alternative methods evaluated included several bootstrap CIs and CIs with the Student t distribution. Results. For the difference between two ROC areas, CIs based on the asymptotic theory provided adequate coverage even when the sample size was very small (20 patients). In contrast,, for a single ROC area, the asymptotic methods for not provide adequate CI coverage for small samples for ROC areas of high accuracy, the sample size must be large (more than 200 patients) for the asymptotic methods to be applicable. The recommended alternative (bootstrap percentile, bootstrap t, or bootstrap bias-corrected accelerated method) depends on the estimation approach, format of the test results, and ROC area. Conclusion. Currently, there is not a single best alternative for constructing CIs for a single ROC area for small samples.
引用
收藏
页码:561 / 571
页数:11
相关论文
共 15 条
[1]   COMPARING THE AREAS UNDER 2 OR MORE CORRELATED RECEIVER OPERATING CHARACTERISTIC CURVES - A NONPARAMETRIC APPROACH [J].
DELONG, ER ;
DELONG, DM ;
CLARKEPEARSON, DI .
BIOMETRICS, 1988, 44 (03) :837-845
[2]   MAXIMUM-LIKELIHOOD ESTIMATION OF PARAMETERS OF SIGNAL-DETECTION THEORY AND DETERMINATION OF CONFIDENCE INTERVALS - RATING-METHOD DATA [J].
DORFMAN, DD ;
ALF, E .
JOURNAL OF MATHEMATICAL PSYCHOLOGY, 1969, 6 (03) :487-&
[3]   MAXIMUM LIKELIHOOD ESTIMATION OF PARAMETERS OF SIGNAL DETECTION THEORY - A DIRECT SOLUTION [J].
DORFMAN, DD ;
ALF, E .
PSYCHOMETRIKA, 1968, 33 (01) :117-&
[4]   RECEIVER OPERATING CHARACTERISTIC RATING ANALYSIS - GENERALIZATION TO THE POPULATION OF READERS AND PATIENTS WITH THE JACKKNIFE METHOD [J].
DORFMAN, DD ;
BERBAUM, KS ;
METZ, CE .
INVESTIGATIVE RADIOLOGY, 1992, 27 (09) :723-731
[5]  
Efron B., 1994, INTRO BOOTSTRAP, V57, DOI DOI 10.1201/9780429246593
[6]  
Efron B., 1982, SOC IND APPL MATH CB, V38, DOI [DOI 10.1137/1.9781611970319, 10.1137/1.9781611970319]
[7]   A comparison of parametric and nonparametric approaches to ROC analysis of quantitative diagnostic tests [J].
HajianTilaki, KO ;
Hanley, JA ;
Joseph, L ;
Collet, JP .
MEDICAL DECISION MAKING, 1997, 17 (01) :94-102
[8]   A METHOD OF COMPARING THE AREAS UNDER RECEIVER OPERATING CHARACTERISTIC CURVES DERIVED FROM THE SAME CASES [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1983, 148 (03) :839-843
[9]   THE MEANING AND USE OF THE AREA UNDER A RECEIVER OPERATING CHARACTERISTIC (ROC) CURVE [J].
HANLEY, JA ;
MCNEIL, BJ .
RADIOLOGY, 1982, 143 (01) :29-36
[10]   ANALYZING A PORTION OF THE ROC CURVE [J].
MCCLISH, DK .
MEDICAL DECISION MAKING, 1989, 9 (03) :190-195