Bias in sensitivity and specificity caused by data-driven selection of optimal cutoff values: Mechanisms, magnitude, and solutions

被引:256
作者
Leeflang, Mariska M. G. [1 ]
Moons, Karel G. M. [2 ]
Reitsma, Johannes B. [1 ]
Zwinderman, Aielko H. [1 ]
机构
[1] Univ Amsterdam, Acad Med Ctr, Dept Clin Epidemiol Biostat & Bioinformat, NL-1100 DE Amsterdam, Netherlands
[2] Univ Med Ctr, Julius Ctr Hlth Sci & Gen Practice, Utrecht, Netherlands
关键词
D O I
10.1373/clinchem.2007.096032
中图分类号
R446 [实验室诊断]; R-33 [实验医学、医学实验];
学科分类号
1001 ;
摘要
BACKGROUND: Optimal cutoff values for tests results involving continuous variables are often derived in a data-driven way. This approach, however, may lead to overly optimistic measures of diagnostic accuracy. We evaluated the magnitude of the bias in sensitivity and specificity associated with data-driven selection of cutoff values and examined potential solutions to reduce this bias. METHODS: Different sample sizes, distributions, and prevalences were used in a simulation study. We compared data-driven estimates of accuracy based on the Youden index with the true values and calculated the median bias. Three alternative approaches (assuming a specific distribution, leave-one-out, smoothed ROC curve) were examined for their ability to reduce this bias. RESULTS: The magnitude of bias caused by data-driven optimization of cutoff values was inversely related to sample size. If the true values for sensitivity and specificity are both 84%, the estimates in studies with a sample size of 40 will be approximately 90%. If the sample size increases to 200, the estimates will be 86%. The distribution of the test results had little impact on the amount of bias when sample size was held constant. More robust methods of optimizing cutoff values were less prone to bias, but the performance deteriorated if the underlying assumptions were not met. CONCLUSIONS: Data-driven selection of the optimal cutoff value can lead to overly optimistic estimates of sensitivity and specificity, especially in small studies. Alternative methods can reduce this bias, but finding robust estimates for cutoff values and accuracy requires considerable sample sizes. (c) 2008 American Association for Clinical Chemistry.
引用
收藏
页码:729 / 737
页数:9
相关论文
共 16 条
[1]   DANGERS OF USING OPTIMAL CUTPOINTS IN THE EVALUATION OF PROGNOSTIC FACTORS [J].
ALTMAN, DG ;
LAUSEN, B ;
SAUERBREI, W ;
SCHUMACHER, M .
JOURNAL OF THE NATIONAL CANCER INSTITUTE, 1994, 86 (11) :829-835
[2]   Sample sizes of studies on diagnostic accuracy: literature survey [J].
Bachmann, LM ;
Puhan, MA ;
ter Riet, G ;
Bossuyt, PM .
BRITISH MEDICAL JOURNAL, 2006, 332 (7550) :1127-1129
[3]   Post hoc choice of cut points introduced bias to diagnostic research [J].
Ewald, Ben .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 2006, 59 (08) :798-801
[4]   Estimation of the Youden index and its associated cutoff point [J].
Fluss, R ;
Faraggi, D ;
Reiser, B .
BIOMETRICAL JOURNAL, 2005, 47 (04) :458-472
[5]   GENERALIZATION OF ONE-SIDED 2-SAMPLE KOLMOGOROV-SMIRNOV STATISTIC FOR EVALUATING DIAGNOSTIC TESTS [J].
GAIL, MH ;
GREEN, SB .
BIOMETRICS, 1976, 32 (03) :561-570
[6]   Principles and practical application of the receiver-operating characteristic analysis for diagnostic tests [J].
Greiner, M ;
Pfeiffer, D ;
Smith, RD .
PREVENTIVE VETERINARY MEDICINE, 2000, 45 (1-2) :23-41
[7]   Methods to estimate the optimal threshold for normally or log-normally distributed biological tests [J].
Jund, J ;
Rabilloud, M ;
Wallon, M ;
Ecochard, R .
MEDICAL DECISION MAKING, 2005, 25 (04) :406-415
[8]   A solution for the most basic optimization problem associated with an ROC curve [J].
Le, Chap T. .
STATISTICAL METHODS IN MEDICAL RESEARCH, 2006, 15 (06) :571-584
[9]   Impact of adjustment for quality on results of metaanalyses of diagnostic accuracy [J].
Leeflang, Mariska ;
Reitsma, Johannes ;
Scholten, Rob ;
Rutjes, Anne ;
Di Nisio, Marcello ;
Deeks, Jon ;
Bossuyt, Patrick .
CLINICAL CHEMISTRY, 2007, 53 (02) :164-172
[10]  
LINNET K, 1986, CLIN CHEM, V32, P1341