Redundancy of single diagnostic test evaluation

被引:64
作者
Moons, KGM
van Es, GA
Michel, BC
Büller, HR
Habbema, JDF
Grobbee, DE
机构
[1] Univ Utrecht, Julius Ctr Patient Oriented Res, NL-3508 GA Utrecht, Netherlands
[2] Erasmus Univ, Sch Med, Dept Epidemiol & Biostat, NL-3000 DR Rotterdam, Netherlands
[3] Erasmus Univ, Sch Med, Dept Publ Hlth, Ctr Clin Decis Sci, Rotterdam, Netherlands
[4] Cardialysis, Rotterdam, Netherlands
[5] Erasmus Univ, Sch Med, Ctr Haemostasis, Rotterdam, Netherlands
[6] Amsterdam Med Ctr, Ctr Haemostasis Thrombosis Atherosclerosis & Infl, Amsterdam, Netherlands
关键词
diagnosis; multivariable; pulmonary embolism; test parameters; univariable; ROC curves;
D O I
10.1097/00001648-199905000-00015
中图分类号
R1 [预防医学、卫生学];
学科分类号
1004 ; 120402 ;
摘要
Diagnostic research and diagnostic practice frequently do not cohere. Studies commonly evaluate whether a single test discriminates between disease presence and absence, whereas in practice a test is always judged in the context of other information. This study illustrates drawbacks of single-test evaluation and discusses principles of diagnostic research. We used data on 140 patients suspected of pulmonary embolism who had an inconclusive ventilation-perfusion lung scan. We evaluated three tests: partial pressure of oxygen in arterial blood (PaO2), x-ray film of the thorax, and leg ultrasound. On the basis of single-test evaluations, ultrasound was most informative. Given a prior probability of 0.27, it had a much better combination of positive and negative predictive value (0.71 and 0.21, respectively) relative to thorax x-ray (0.33 and 0.11) and PaO2 (0.35 and 0.27). The combination of positive and negative likelihood ratio was also more promising for ultrasound (7.3 and 0.7) than for thorax x-ray (1.3 and 0.3) and PaO2 (1.3 and 0.9). As the tests are always performed after the history and physical, we judged their added. value using multivariable logistic modeling with receiver operating characteristic (ROC) analyses. The ROC areas of the model, including history and physical, with additional PaO2, thorax x-ray, or ultrasound, were 0.75, 0.77, 0.81, and 0.81, respectively, which indicated similar added value of thorax x-ray and ultrasound. Application of the models to patient subgroups also yielded added predictive value for thorax x-ray film. Thus, the results of single-test evaluations may be very misleading. As no diagnosis is based on one test, single-test evaluations have limited value in diagnostic research and only have relevance in the context of screening and the initial phase of test development. Diagnostic research should always apply an approach of constructing, extending, and validating diagnostic models in agreement with routine clinical work up using logistic regression analyses.
引用
收藏
页码:276 / 281
页数:6
相关论文
共 40 条
[1]   EXPERIMENTAL-DESIGN OF MEDICAL IMAGING TRIALS - ISSUES AND OPTIONS [J].
BEGG, CB .
INVESTIGATIVE RADIOLOGY, 1989, 24 (11) :934-936
[2]  
BEGG CB, 1986, CRC CRIT R MED INFOR, V1, P1
[3]   ANOTHER VIEW OF POLYCHOTOMOUS ANALYSES - RESPONSE [J].
BEGG, CB ;
MCNEIL, BJ .
MEDICAL DECISION MAKING, 1985, 5 (01) :123-126
[4]  
DELONG ER, 1988, BIOMETRICS, V44, P817
[5]   REVEREND BAYES SILENT MAJORITY - AN ALTERNATIVE FACTOR AFFECTING SENSITIVITY AND SPECIFICITY OF EXERCISE ELECTROCARDIOGRAPHY [J].
DIAMOND, GA .
AMERICAN JOURNAL OF CARDIOLOGY, 1986, 57 (13) :1175-1180
[6]   CLINICAL EPISTEMOLOGY OF SENSITIVITY AND SPECIFICITY - DISSENT [J].
DIAMOND, GA .
JOURNAL OF CLINICAL EPIDEMIOLOGY, 1992, 45 (01) :9-13
[7]   CLINICAL JUDGMENT REVISITED - THE DISTRACTION OF QUANTITATIVE MODELS [J].
FEINSTEIN, AR .
ANNALS OF INTERNAL MEDICINE, 1994, 120 (09) :799-805
[8]   EVALUATING AND COMPARING IMAGING TECHNIQUES - A REVIEW AND CLASSIFICATION OF STUDY DESIGNS [J].
FREEDMAN, LS .
BRITISH JOURNAL OF RADIOLOGY, 1987, 60 (719) :1071-1081
[9]   THE EFFICACY OF DIAGNOSTIC-IMAGING [J].
FRYBACK, DG ;
THORNBURY, JR .
MEDICAL DECISION MAKING, 1991, 11 (02) :88-94
[10]  
FRYBACK DG, 1995, 13 NCRP