A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia

被引:60
作者
Chapman, WW [1 ]
Fizman, M
Chapman, BE
Huag, PJ
机构
[1] Univ Pittsburgh, Ctr Biomed Informat, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Dept Radiol, Pittsburgh, PA 15213 USA
[3] Univ Utah, LDS Hosp, Dept Med Informat, Salt Lake City, UT USA
关键词
D O I
10.1006/jbin.2001.1000
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
We compared the performance of expert-crafted rules, a Bayesian network, and a decision tree at automatically identifying chest X-ray reports that support acute bacterial pneumonia. We randomly selected 292 chest X-ray reports, 75 (25%) of which were from patients with a hospital discharge diagnosis of bacterial pneumonia. The reports were encoded by our natural language processor and then manually corrected for mistakes. The encoded observations were analyzed by three expert systems to determine whether the reports supported pneumonia. The reference standard for radiologic support of pneumonia was the majority vote of three physicians. We compared (a) the performance of thr expert systems against each other and ib) the performance of the expert systems against that of four physicians who were not part of the gold standard. Output from the expert systems and the physicians a as transformed so that comparisons could be made with both binary and probabilistic output. Metrics of comparison For binary output were sensitivity (sens), precision (prec). and specificity (spec). The metric of comparison for probabilistic output was the area under the receiver operator characteristic (ROC) curve. We used McNemar's test to determine statistical significance for binary output and univariate z-tests for probabilistic output. Measures of performance of the expert systems for binary (probabilistic) output were as follows: Rules-sens. 0.92; prec, 0.80; spec, 0.86 (A(z), 0.960); Bayesian network-sens, 0.90; prec, 0.72; spec, 0.78 (A(z), 0.945); decision tree-sens, 0.86; prec. 0.85; spec, 0.91 (A(z), 0.940). Comparisons of the expert systems against each other using binary output showed a significant difference between the rules and the Bayesian network and between the decision tree and the Bayesian network. Comparisons of expert systems using probabilistic output showed no significant differences. Comparisons of binary output against physicians showed differences between the Bayesian network and two physicians. Comparisons of probabilistic output against physicians showed a difference between the decision tree and one physician. The expert systems performed similarly for the probabilistic output but differed in measures of sensitivity, precision, and specificity produced by the binary output. All three expert systems performed similarly to physicians. (C) 2001 Academic Press.
引用
收藏
页码:4 / 14
页数:11
相关论文
共 37 条
  • [1] STATISTICS NOTES - ABSENCE OF EVIDENCE IS NOT EVIDENCE OF ABSENCE
    ALTMAN, DG
    BLAND, JM
    [J]. BRITISH MEDICAL JOURNAL, 1995, 311 (7003) : 485 - 485
  • [2] [Anonymous], 1991, A primer on generalizability theory
  • [3] Aronsky D, 2000, J AM MED INFORM ASSN, P12
  • [4] Aronsky D, 1998, J AM MED INFORM ASSN, P632
  • [5] Ceusters W, 1997, ST HEAL T, V43, P396
  • [6] A computer-assisted management program for antibiotics and other antiinfective agents
    Evans, RS
    Pestotnik, SL
    Classen, DC
    Clemmer, TP
    Weaver, LK
    Orme, JF
    Lloyd, JF
    Burke, JP
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1998, 338 (04) : 232 - 238
  • [7] Automatic detection of acute bacterial pneumonia from chest x-ray reports
    Fiszman, M
    Chapman, WW
    Aronsky, D
    Evans, RS
    Haug, PJ
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, 7 (06) : 593 - 604
  • [8] Fiszman M, 2000, J AM MED INFORM ASSN, P235
  • [9] Fiszman M, 1999, J AM MED INFORM ASSN, P67
  • [10] Fiszman M, 1998, J AM MED INFORM ASSN, P860