A cascade of classifiers for extracting medication information from discharge summaries

被引:11
作者
Halgrim S.R. [1 ]
Xia F. [1 ]
Solti I. [2 ]
Cadag E. [1 ]
Uzuner [3 ]
机构
[1] University of Washington, PO Box 543450, Seattle, 98195, WA
[2] Cincinnati Children's Hospital Medical Center, Cincinnati, 45229-3039, OH
[3] University of Albany, SUNY, 135 Western Ave, Albany, 12222, NY
基金
美国国家卫生研究院;
关键词
Regular Expression; Discharge Summary; Conditional Random Field; Narrative Text; Field Detection;
D O I
10.1186/2041-1480-2-S3-S2
中图分类号
学科分类号
摘要
Background: Extracting medication information from clinical records has many potential applications, and recently published research, systems, and competitions reflect an interest therein. Much of the early extraction work involved rules and lexicons, but more recently machine learning has been applied to the task. Methods: We present a hybrid system consisting of two parts. The first part, field detection, uses a cascade of statistical classifiers to identify medication-related named entities. The second part uses simple heuristics to link those entities into medication events. Results: The system achieved performance that is comparable to other approaches to the same task. This performance is further improved by adding features that reference external medication name lists. Conclusions: This study demonstrates that our hybrid approach outperforms purely statistical or rule-based systems. The study also shows that a cascade of classifiers works better than a single classifier in extracting medication information. The system is available as is upon request from the first author. © 2011 Halgrim et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 21 条
  • [1] Levin M.A., Krol M., Doshi A.M., Reich D.L., Extraction and mapping of drug names from free text to a standardized nomenclature, AMIA Annual Symposium Proceedings: 10-14 November, pp. 438-442, (2007)
  • [2] Gold S., Elhadad N., Zhu M., Cimino J.J., Hripcsak G., Extracting structured medication event information from discharge summaries, AMIA Annual Symposium Proceedings, pp. 237-241, (2008)
  • [3] Xu H., Stenner S.P., Doan S., Johnson K.B., Waitman L.R., Denny J.C., MedEx: a medication information extraction system for clinical narratives, Journal of the American Medical Informatics Association, 17, pp. 19-24, (2010)
  • [4] Taira R.K., Soderland S.G., A statistical natural language processor for medical reports, Proceedings of the AMIA Symposium: 6-8 November 1999
  • [5] Washington. Edited by: Nancy M. Lorenzi, pp. 970-974, (1999)
  • [6] Patrick J., Li M., High accuracy information extraction of medication information from clinical notes: 2009 i2b2 medication extraction challenge, Journal of the American Medical Informatics Association, 17, pp. 524-527, (2010)
  • [7] Tikk D., Solt I., Improving textual medication extraction using combined conditional random fields and rule-based systems, Journal of the American Medical Informatics Association, 17, pp. 540-544, (2010)
  • [8] Li Z., Liu F., Antieau L., Cao Y., Yu H., Lancet: a high precision medication event extraction system for clinical text, Journal of the American Medical Informatics Association, 17, pp. 563-567, (2010)
  • [9] Taira R.K., Bui A.A.T., Kangarloo H., Identification of patient name references within medical documents using semantic selectional restrictions, Proceedings of the AMIA Symposium: 9-13 November 2002
  • [10] San Antonio. Edited by: Isaac S. Kohane, pp. 757-761, (2002)