A simple algorithm for identifying negated findings and diseases in discharge summaries

被引:607
作者
Chapman, WW
Bridewell, W
Hanbury, P
Cooper, GF
Buchanan, BG
机构
[1] Univ Pittsburgh, Ctr Biomed Informat, Pittsburgh, PA 15213 USA
[2] Univ Pittsburgh, Dept Comp Sci, Pittsburgh, PA 15213 USA
关键词
text classification; pertinent negatives; negation narrative; medical reports; natural language processing; artificial intelligence;
D O I
10.1006/jbin.2001.1029
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Narrative reports in medical records contain a wealth of information that may augment structured data for managing patient information and predicting trends in diseases. Pertinent negatives are evident in text but are not usually indexed in structured databases. The objective of the study reported here was to test a simple algorithm for determining whether a finding or disease mentioned within narrative medical reports is present or absent. We developed a simple regular expression algorithm called NegEx that implements several phrases indicating negation, filters out sentences containing phrases that falsely appear to be negation phrases, and limits the scope of the negation phrases. We compared NegEx against a baseline algorithm that has a limited set of negation phrases and a simpler notion of scope. In a test of 1235 findings and diseases in 1000 sentences taken from discharge summaries indexed by physicians, NegEx had a specificity of 94.5% (versus 85.3% for the baseline), a positive predictive value of 84.5% (versus 68.4% for the baseline) while maintaining a reasonable sensitivity of 77.8% (versus 88.3% for the baseline). We conclude that with little implementation effort a simple regular expression algorithm for determining whether a finding or disease is absent can identify a large portion of the pertinent negatives from discharge summaries. (C) 2001 Elsevier Science (USA).
引用
收藏
页码:301 / 310
页数:10
相关论文
共 24 条
  • [1] Aronis JM, 1999, J AM MED INFORM ASSN, P658
  • [2] Aronson A. R., 1994, P RIAO, V1, P197
  • [3] Chapman WW, 2001, J AM MED INFORM ASSN, P105
  • [4] Cooper GF, 1998, J AM MED INFORM ASSN, P180
  • [5] Automatic detection of acute bacterial pneumonia from chest x-ray reports
    Fiszman, M
    Chapman, WW
    Aronsky, D
    Evans, RS
    Haug, PJ
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2000, 7 (06) : 593 - 604
  • [6] Fiszman M, 2000, J AM MED INFORM ASSN, P235
  • [7] Friedman C, 1999, J AM MED INFORM ASSN, P256
  • [8] Natural language processing and its future in medicine
    Friedman, C
    Hripcsak, G
    [J]. ACADEMIC MEDICINE, 1999, 74 (08) : 890 - 895
  • [9] A GENERAL NATURAL-LANGUAGE TEXT PROCESSOR FOR CLINICAL RADIOLOGY
    FRIEDMAN, C
    ALDERSON, PO
    AUSTIN, JHM
    CIMINO, JJ
    JOHNSON, SB
    [J]. JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1994, 1 (02) : 161 - 174
  • [10] HERSH W, 1996, INFORMATION RETRIEVA