ENHANCING AND EVALUATING DIAGNOSTIC-ACCURACY

被引:96
作者
SWETS, JA
GETTY, DJ
PICKETT, RM
DORSI, CJ
SELTZER, SE
MCNEIL, BJ
机构
[1] BRIGHAM & WOMENS HOSP,DEPT RADIOL,BOSTON,MA 02115
[2] HARVARD UNIV,SCH MED,DEPT HLTH CARE POLICY,BOSTON,MA 02115
关键词
COMPUTER-AIDED DIAGNOSIS; EXPERT SYSTEMS; TECHNOLOGY ASSESSMENT; QUALITY ASSURANCE; DIAGNOSTIC ACCURACY; ROC ANALYSIS; FEATURE ANALYSIS; COGNITIVE PROCESSES; PERCEPTION;
D O I
10.1177/0272989X9101100102
中图分类号
R19 [保健组织与事业(卫生事业管理)];
学科分类号
摘要
Techniques that may enhance diagnostic accuracy in clinical settings were tested in the context of mammography. Statistical information about the relevant features among those visible in a mammogram and about their relative importances in the diagnosis of breast cancer was the basis of two decision aids for radiologists: a checklist that guides the radiologist in assigning a scale value to each significant feature of the images of a particular case, and a computer program that merges those scale values optimally to estimate a probability of malignancy. A test set of approximately 150 proven cases (including normals and benign and malignant lesions) was interpreted by six radiologists, first in their usual manner and later with the decision aids. The enhancing effect of these feature-analytic techniques was analyzed across subsets of cases that were restricted progressively to more and more difficult cases, where difficulty was defined in terms of the radiologists' judgments in the standard reading condition. Accuracy in both standard and enhanced conditions decreased regularly and substantially as case difficulty increased, but differentially, such that the enhancement effect grew regularly and substantially. For the most difficult case sets, the observed increases in accuracy translated into an increase of about 0.15 in sensitivity (true-positive proportion) for a selected specificity (true-negative proportion) of 0.85 or a similar increase in specificity for a selected sensitivity of 0.85. That measured accuracy can depend on case-set difficulty to different degrees for two diagnostic approaches has general implications for evaluation in clinical medicine. Comparative, as well as absolute, assessments of diagnostic performances-for example, of alternative imaging techniques-may be distorted by inadequate treatments of this experimental variable. Subset analysis, as defined and illustrated here, can be useful in alleviating the problem.
引用
收藏
页码:9 / 18
页数:10
相关论文
共 9 条