Assertion modeling and its role in clinical phenotype identification

被引:15
作者
Bejan, Cosmin Adrian [1 ]
Vanderwende, Lucy [1 ,2 ]
Xia, Fei [1 ,3 ]
Yetisgen-Yildiz, Meliha [1 ,3 ]
机构
[1] Univ Washington, Seattle, WA 98195 USA
[2] Microsoft Corp, Microsoft Res, Redmond, WA 98052 USA
[3] Univ Washington, Dept Linguist, Seattle, WA 98195 USA
关键词
Natural language processing; Clinical information extraction; Assertion classification; Pneumonia identification; Statistical feature selection; NEGATION DETECTION; SYSTEM; UMLS;
D O I
10.1016/j.jbi.2012.09.001
中图分类号
TP39 [计算机的应用];
学科分类号
080201 [机械制造及其自动化];
摘要
This paper describes an approach to assertion classification and an empirical study on the impact this task has on phenotype identification, a real world application in the clinical domain. The task of assertion classification is to assign to each medical concept mentioned in a clinical report (e.g., pneumonia, chest pain) a specific assertion category (e.g., present, absent, and possible). To improve the classification of medical assertions, we propose several new features that capture the semantic properties of special cue words highly indicative of a specific assertion category. The results obtained outperform the current state-of-the-art results for this task. Furthermore, we confirm the intuition that assertion classification contributes in significantly improving the results of phenotype identification from free-text clinical records. (C) 2012 Elsevier Inc. All rights reserved.
引用
收藏
页码:68 / 74
页数:7
相关论文
共 30 条
[1]
[Anonymous], 2011, P 49 ANN M ASS COMPU
[2]
[Anonymous], 1997, ICML
[3]
Ad hoc classification of radiology reports [J].
Aronow, DB ;
Feng, FF ;
Croft, WB .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 1999, 6 (05) :393-411
[4]
Aronson AR, 2001, J AM MED INFORM ASSN, P17
[5]
Pneumonia identification using statistical feature selection [J].
Bejan, Cosmin Adrian ;
Xia, Fei ;
Vanderwende, Lucy ;
Wurfel, Mark M. ;
Yetisgen-Yildiz, Meliha .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2012, 19 (05) :817-823
[6]
The Unified Medical Language System (UMLS): integrating biomedical terminology [J].
Bodenreider, O .
NUCLEIC ACIDS RESEARCH, 2004, 32 :D267-D270
[7]
A simple algorithm for identifying negated findings and diseases in discharge summaries [J].
Chapman, WW ;
Bridewell, W ;
Hanbury, P ;
Cooper, GF ;
Buchanan, BG .
JOURNAL OF BIOMEDICAL INFORMATICS, 2001, 34 (05) :301-310
[8]
MITRE system for clinical assertion status classification [J].
Clark, Cheryl ;
Aberdeen, John ;
Coarr, Matt ;
Tresner-Kirsch, David ;
Wellner, Ben ;
Yeh, Alexander ;
Hirschman, Lynette .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :563-567
[9]
Machine-learned solutions for three stages of clinical information extraction: the state of the art at i2b2 2010 [J].
de Bruijn, Berry ;
Cherry, Colin ;
Kiritchenko, Svetlana ;
Martin, Joel ;
Zhu, Xiaodan .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2011, 18 (05) :557-562
[10]
Elkin P.L., 2005, BMC MED INFORM DECIS, V5