PERFORMANCE OF 4 COMPUTER-BASED DIAGNOSTIC SYSTEMS

被引:198
作者
BERNER, ES
WEBSTER, GD
SHUGERMAN, AA
JACKSON, JR
ALGINA, J
BAKER, AL
BALL, EV
COBBS, CG
DENNIS, VW
FRENKEL, EP
HUDSON, LD
MANCALL, EL
RACKLEY, CE
TAUNTON, D
机构
[1] INFORMED INC,ST DAVIDS,PA
[2] UNIV FLORIDA,GAINESVILLE,FL
[3] UNIV CHICAGO,CHICAGO,IL 60637
[4] CLEVELAND CLIN FDN,CLEVELAND,OH 44195
[5] UNIV WASHINGTON,SEATTLE,WA 98195
[6] HAHNEMANN UNIV,PHILADELPHIA,PA 19102
[7] GEORGETOWN UNIV,WASHINGTON,DC
[8] BAPTIST MED CTR MONTCLAIR,BIRMINGHAM,AL
[9] UNIV TEXAS,DALLAS,TX 75230
关键词
D O I
10.1056/NEJM199406233302506
中图分类号
R5 [内科学];
学科分类号
1002 ; 100201 ;
摘要
Background. Computer-based diagnostic systems are available commercially, but there has been limited evaluation of their performance. We assessed the diagnostic capabilities of four internal medicine diagnostic systems: Dxplain, Iliad, Meditel, and QMR. Methods. Ten expert clinicians created a set of 105 diagnostically challenging clinical case summaries involving actual patients. Clinical data were entered into each program with the vocabulary provided by the program's developer. Each of the systems produced a ranked list of possible diagnoses for each patient, as did the group of experts. We calculated scores on several performance measures for each computer program. Results. No single computer program scored better than the others on all performance measures. Among all cases and all programs, the proportion of correct diagnoses ranged from 0.52 to 0.71, and the mean proportion of relevant diagnoses ranged from 0.19 to 0.37. On average, less than half the diagnoses on the experts' original list of reasonable diagnoses were suggested by any of the programs. However, each program suggested an average of approximately two additional diagnoses per case that the experts found relevant but had not originally considered. Conclusions. The results provide a profile of the strengths and limitations of these computer programs. The programs should be used by physicians who can identify and use the relevant information and ignore the irrelevant information that can be produced.
引用
收藏
页码:1792 / 1796
页数:5
相关论文
共 50 条
  • [31] COMPUTER-ASSISTED PEDIATRIC DIAGNOSIS
    OSHEA, JS
    [J]. AMERICAN JOURNAL OF DISEASES OF CHILDREN, 1975, 129 (02): : 199 - 202
  • [32] RAND TG, 1992, ACP OBS, V12, P10
  • [33] REGGIA JA, 1985, COMPUTER ASSISTED ME
  • [34] Salomon G., 1991, EDUC RES-UK, V20, P2, DOI DOI 10.3102/0013189X020003002
  • [35] ARTIFICIAL-INTELLIGENCE IN MEDICINE - WHERE DO WE STAND
    SCHWARTZ, WB
    PATIL, RS
    SZOLOVITS, P
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1987, 316 (11) : 685 - 688
  • [36] MODIFIED SEQUENTIALLY REJECTIVE MULTIPLE TEST PROCEDURES
    SHAFFER, JP
    [J]. JOURNAL OF THE AMERICAN STATISTICAL ASSOCIATION, 1986, 81 (395) : 826 - 831
  • [37] COMPUTER-PROGRAMS TO SUPPORT CLINICAL DECISION-MAKING
    SHORTLIFFE, EH
    [J]. JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 1987, 258 (01): : 61 - 66
  • [38] THE ADOLESCENCE OF AI IN MEDICINE - WILL THE FIELD COME OF AGE IN THE 90S
    SHORTLIFFE, EH
    [J]. ARTIFICIAL INTELLIGENCE IN MEDICINE, 1993, 5 (02) : 93 - 106
  • [39] SHORTLIFFE EH, 1990, MED INFORMATICS COMP
  • [40] COMPUTER-ASSISTED DIAGNOSIS
    SWENDER, PT
    TUNNESSEN, WW
    OSKI, FA
    [J]. AMERICAN JOURNAL OF DISEASES OF CHILDREN, 1974, 127 (06): : 859 - 861