Selecting information in electronic health records for knowledge acquisition

被引:32
作者
Wang, Xiaoyan [1 ]
Chase, Herbert [1 ]
Markatou, Marianthi [2 ]
Hripcsak, George [1 ]
Friedman, Carol [1 ]
机构
[1] Columbia Univ, Dept Biomed Informat, New York, NY 10032 USA
[2] Columbia Univ, Dept Biostat, New York, NY 10032 USA
基金
美国国家科学基金会;
关键词
Knowledge acquisition; Natural language processing (NLP); Text mining; Pharmacovigilance; Decision support; Electronic health record (EHR); CLINICAL DOCUMENTS; LANGUAGE; PHARMACOVIGILANCE;
D O I
10.1016/j.jbi.2010.03.011
中图分类号
TP39 [计算机的应用];
学科分类号
081203 ; 0835 ;
摘要
Knowledge acquisition of relations between biomedical entities is critical for many automated biomedical applications, including pharmacovigilance and decision support. Automated acquisition of statistical associations from biomedical and clinical documents has shown some promise. However, acquisition of clinically meaningful relations (i.e. specific associations) remains challenging because textual information is noisy and co-occurrence does not typically determine specific relations. In this work, we focus on acquisition of two types of relations from clinical reports: disease-manifestation related symptom (MRS) and drug-adverse drug event (ADE), and explore the use of filtering by sections of the reports to improve performance. Evaluation indicated that applying the filters improved recall (disease-MRS: from 0.85 to 0.90; drug-ADE: from 0.43 to 0.75) and precision (disease-MRS: from 0.82 to 0.92; drug-ADE: from 0.16 to 0.31). This preliminary study demonstrates that selecting information in narrative electronic reports based on the sections improves the detection of disease-MRS and drug-ADE types of relations. Further investigation of complementary methods, such as more sophisticated statistical methods, more complex temporal models and use of information from other knowledge sources, is needed. (C) 2010 Elsevier Inc. All rights reserved.
引用
收藏
页码:595 / 601
页数:7
相关论文
共 32 条
[1]  
Aronson AR, 2000, J AM MED INFORM ASSN, P17
[3]   A statistical methodology for analyzing co-occurrence data from a large sample [J].
Cao, Hui ;
Hripcsak, George ;
Markatou, Marianthi .
JOURNAL OF BIOMEDICAL INFORMATICS, 2007, 40 (03) :343-352
[4]  
Cao Hui, 2005, AMIA Annu Symp Proc, P106
[5]   Automated acquisition of disease-drug knowledge from biomedical and clinical documents: An initial study [J].
Chen, Elizabeth S. ;
Hripcsak, George ;
Xu, Hua ;
Markatou, Marianthi ;
Friedman, Carol .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2008, 15 (01) :87-98
[6]  
Chen LF, 2004, STUD HEALTH TECHNOL, V107, P758
[7]  
Christensen LM., 2002, P ACL 02 WORKSHOP NA, P29
[8]  
Denny Joshua C, 2008, AMIA Annu Symp Proc, P156
[9]   Evaluation of a Method to Identify and Categorize Section Headers in Clinical Documents [J].
Denny, Joshua C. ;
Spickard, Anderson, III ;
Johnson, Kevin B. ;
Peterson, Neeraja B. ;
Peterson, Josh F. ;
Miller, Randolph A. .
JOURNAL OF THE AMERICAN MEDICAL INFORMATICS ASSOCIATION, 2009, 16 (06) :806-815
[10]  
FERRI F, 2006, FERRIS DIFFERENTIAL