A controlled trial of automated classification of negation from clinical notes

被引:52
作者
Elkin P.L. [1 ]
Brown S.H. [2 ,3 ]
Bauer B.A. [1 ]
Husser C.S. [1 ]
Carruth W. [4 ]
Bergstrom L.R. [1 ]
Wahner-Roedler D.L.
机构
[1] Department of Medicine, Mayo Foundation, Rochester, MN
[2] Department of Biomedical Informatics, Vanderbilt University, Nashville, TN
[3] Department of Internal Medicine, Johns Hopkins School of Medicine, Baltimore, MD
关键词
Positive Likelihood Ratio; Automate Assignment; Lymphangitis; Negative Concept; Compositional Expression;
D O I
10.1186/1472-6947-5-13
中图分类号
学科分类号
摘要
Background: Identification of negation in electronic health records is essential if we are to understand the computable meaning of the records: Our objective is to compare the accuracy of an automated mechanism for assignment of Negation to clinical concepts within a compositional expression with Human Assigned Negation. Also to perform a failure analysis to identify the causes of poorly identified negation (i.e. Missed Conceptual Representation, Inaccurate Conceptual Representation, Missed Negation, Inaccurate identification of Negation). Methods: 41 Clinical Documents (Medical Evaluations; sometimes outside of Mayo these are referred to as History and Physical Examinations) were parsed using the Mayo Vocabulary Server Parsing Engine. SNOMED-CT™ was used to provide concept coverage for the clinical concepts in the record. These records resulted in identification of Concepts and textual clues to Negation. These records were reviewed by an independent medical terminologist, and the results were tallied in a spreadsheet. Where questions on the review arose Internal Medicine Faculty were employed to make a final determination. Results: SNOMED-CT was used to provide concept coverage of the 14,792 Concepts in 41 Health Records from John's Hopkins University. Of these, 1,823 Concepts were identified as negative by Human review. The sensitivity (Recall) of the assignment of negation was 97.2% (p < 0.001, Pearson Chi-Square test; when compared to a coin flip). The specificity of assignment of negation was 98.8%. The positive likelihood ratio of the negation was 81. The positive predictive value (Precision) was 91.2% Conclusion: Automated assignment of negation to concepts identified in health records based on review of the text is feasible and practical. Lexical assignment of negation is a good test of true Negativity as judged by the high sensitivity, specificity and positive likelihood ratio of the test. SNOMED-CT had overall coverage of 88.7% of the concepts being negated. © 2005 Elkin et al; licensee BioMed Central Ltd.
引用
收藏
相关论文
共 24 条
[1]  
Sager N., Syntactic analysis of natural language, Advances in Computers Volume 8, 8, pp. 153-188, (1967)
[2]  
Grishman R., Sager N., Raze C., Bookchin B., The Linguistic String Parser, 42, pp. 427-434, (1973)
[3]  
Sager N., Grishman R., The restriction language for computer grammars of natural language, Communications of the ACM, 18, pp. 390-400, (1975)
[4]  
Friedman C., Shagina L., Lussier Y., Hripcsak G., Automated encoding of clinical documents based on natural language processing, J Am Med Inform Assoc, 11, pp. 392-402, (2004)
[5]  
Nadkarni P., Chen R., Brandt C., UMLS concept indexing for production databases: A feasibility study, J Am Med Inform Assoc, 8, pp. 80-91, (2001)
[6]  
Huang Y., Lowe H.J., Hersh W.R., A pilot study of contextual UMLS indexing to improve the precision of concept-based representation in XML-structured clinical radiology reports, J Am Med Inform Assoc, 10, pp. 580-587, (2003)
[7]  
Aronson A.R., Bodenreider O., Chang H.F., Humphrey S.M., Mork J.G., Nelson S.J., The NLM Indexing Initiative, pp. 17-21, (2000)
[8]  
Aronson A.R., Effective Mapping of Biomedical Text to the UMLS Metathesaurus: The MetaMap Program, pp. 17-21, (2001)
[9]  
Brennan P.F., Aronson A.R., Towards linking patients and clinical information: Detecting UMLS concepts in e-mail, J Biomed Inform, 36, pp. 334-341, (2003)
[10]  
Elkin P.L., Bailey K.R., Chute C.G., A randomized controlled trial of automated term composition, Proc AMIA Symp, pp. 765-769, (1998)