Creating a text classifier to detect radiology reports describing mediastinal findings associated with inhalational anthrax and other disorders

被引:29
作者
Chapman, WW
Cooper, GF
Hanbury, P
Chapman, BE
Harrison, LH
Wagner, MM
机构
[1] Univ Pittsburgh, Ctr Biomed Informat, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, RODS Lab, Pittsburgh, PA USA
[3] Univ Pittsburgh, Dept Radiol, Pittsburgh, PA 15260 USA
[4] Univ Pittsburgh, Dept Epidemiol, Pittsburgh, PA 15260 USA
[5] Univ Pittsburgh, Dept Med, Infect Dis Epidemiol Res Unit, Pittsburgh, PA 15260 USA
关键词
D O I
10.1197/jamia.M1330
中图分类号
TP [自动化技术、计算机技术];
学科分类号
0812 ;
摘要
Objective: The aim of this study was to create a classifier for automatic detection of chest radiograph reports consistent with the mediastinal findings of inhalational anthrax. Design: The authors used the Identify Patient Sets (IPS) system to create a key word classifier for detecting reports describing mediastinal findings consistent with anthrax and compared their performances on a test set of 79,032 chest radiograph reports. Measurements: Area under the ROC curve was the main outcome measure of the IPS classifier. Sensitivity and specificity of an initial IPS model were calculated based on an existing key word search and were compared against a Boolean version of the IPS classifier. Results: The IPS classifier received an area under the ROC curve of 0.677 (90% Cl = 0.628 to 0.772) with a specificity of 0.99 and maximum sensitivity of 0.35. The initial IPS model attained a specificity of 1.0 and a sensitivity of 0.04. Conclusion: The IPS system is a useful tool for helping domain experts create a statistical key word classifier for textual reports that is a potentially useful component in surveillance of radiographic findings suspicious for anthrax.
引用
收藏
页码:494 / 503
页数:10
相关论文
共 37 条
  • [1] PATHOLOGY OF INHALATIONAL ANTHRAX IN 42 CASES FROM THE SVERDLOVSK OUTBREAK OF 1979
    ABRAMOVA, FA
    GRINBERG, LM
    YAMPOLSKAYA, OV
    WALKER, DH
    [J]. PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 1993, 90 (06) : 2291 - 2294
  • [2] [Anonymous], 1999, P KDD, DOI [10.1145/312129.312195, DOI 10.1016/J.EC0LENG.2010.11.031]
  • [3] Aronis JM, 1999, J AM MED INFORM ASSN, P658
  • [4] Chapman WW, 2001, J AM MED INFORM ASSN, P105
  • [5] A simple algorithm for identifying negated findings and diseases in discharge summaries
    Chapman, WW
    Bridewell, W
    Hanbury, P
    Cooper, GF
    Buchanan, BG
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) : 301 - 310
  • [6] A comparison of classification algorithms to automatically identify chest X-ray reports that support pneumonia
    Chapman, WW
    Fizman, M
    Chapman, BE
    Huag, PJ
    [J]. JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (01) : 4 - 14
  • [7] CHAPMAN WW, 2003, IN PRESS J URBAN H S
  • [8] Cooper GF, 1998, J AM MED INFORM ASSN, P180
  • [9] Anthrax
    Dixon, TC
    Meselson, M
    Guillemin, J
    Hanna, PC
    [J]. NEW ENGLAND JOURNAL OF MEDICINE, 1999, 341 (11) : 815 - 826
  • [10] MULTIREADER, MULTICASE RECEIVER OPERATING CHARACTERISTIC METHODOLOGY - A BOOTSTRAP ANALYSIS
    DORFMAN, DD
    BERBAUM, KS
    LENTH, RV
    [J]. ACADEMIC RADIOLOGY, 1995, 2 (07) : 626 - 633