Evaluating Report Text Variation and Informativeness: Natural Language Processing of CT Chest Imaging for Pulmonary Embolism

被引:21
作者
Huesch, Marco D. [1 ]
Cherian, Rekha [1 ]
Labib, Sam [1 ]
Mahraj, Rickhesvar [1 ]
机构
[1] Milton S Hershey Med Ctr, Dept Radiol, 500 Univ Dr,Mailcode H-066, Hershey, PA 17033 USA
关键词
Structured reporting; text analysis; pulmonary embolus; machine learning; variability; prediction; natural language processing; NLP; CONVOLUTIONAL NEURAL-NETWORKS; RADIOLOGY REPORTS; BIG DATA; CLASSIFICATION; ENHANCEMENT; MEDICINE; IMPACT;
D O I
10.1016/j.jacr.2017.12.017
中图分类号
R8 [特种医学]; R445 [影像诊断学];
学科分类号
1002 ; 100207 ; 1009 ;
摘要
Objective: The aim of this study was to quantify the variability of language in free text reports of pulmonary embolus (PE) studies and to gauge the informativeness of free text to predict PE diagnosis using machine learning as proxy for human understanding. Materials and Methods: All 1,133 consecutive chest CTs with contrast studies performed under a PE protocol and ordered in the emergency department in 2016 were selected from our departmental electronic workflow system. We used commercial text-mining and predictive analytics software to parse and describe all report text and to generate a suite of machine learning rules that sought to predict the "gold standard" radiological diagnosis of PE. Results: There was extensive variation in the length of Findings section and Impression section texts across the reports, only marginally associated with a positive PE diagnosis. A marked concentration of terms was found: for example, 20 words were used in the Findings section of 93% of the reports, and 896 of 2,296 distinct words were each used in only one report's Impression section. In the validation set, machine learning rules had perfect sensitivity but imperfect specificity, a low positive predictive value of 73%, and a misclassification rate of 3%. Conclusion: Use of free text reporting was associated with extensive variability in report length and report terms used. Interpretation of the free text was a difficult machine learning task and suggests potential difficulty for human recipients in fully understanding such reports. These results support the prospective assessment of the impact of a fully structured report template with at least some mandatory discrete fields on ease of use of reports and their understanding.
引用
收藏
页码:554 / 562
页数:9
相关论文
共 21 条
[1]   Structured Reporting of Multiphasic CT for Pancreatic Cancer: Potential Effect on Staging and Surgical Planning [J].
Brook, Olga R. ;
Brook, Alexander ;
Vollmer, Charles M. ;
Kent, Tara S. ;
Sanchez, Norberto ;
Pedrosa, Ivan .
RADIOLOGY, 2015, 274 (02) :464-472
[2]   Unintended Consequences of Machine Learning in Medicine [J].
Cabitza, Federico ;
Rasoini, Raffaele ;
Gensini, Gian Franco .
JAMA-JOURNAL OF THE AMERICAN MEDICAL ASSOCIATION, 2017, 318 (06) :517-518
[3]  
Chakraborty G, 12882014 SAS I
[4]  
Dang Pragya A, 2008, J Am Coll Radiol, V5, P197, DOI 10.1016/j.jacr.2007.09.003
[5]   Application of recently developed computer algorithm for automatic classification of unstructured radiology reports: Validation study [J].
Dreyer, KJ ;
Kalra, MK ;
Maher, MM ;
Hurier, AM ;
Asfaw, BA ;
Schultz, T ;
Halpern, EF ;
Thrall, JH .
RADIOLOGY, 2005, 234 (02) :323-329
[6]   Physician Documentation Deficiencies in Abdominal Ultrasound Reports: Frequency, Characteristics, and Financial Impact [J].
Duszak, Richard, Jr. ;
Nossal, Michael ;
Schofield, Lyle ;
Picus, Daniel .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2012, 9 (06) :403-408
[7]   Impact of a Structured Report Template on the Quality of CT and MRI Reports for Hepatocellular Carcinoma Diagnosis [J].
Flusberg, Milana ;
Ganeles, Jeremy ;
Ekinci, Tulay ;
Goldberg-Stein, Shlomit ;
Paroder, Viktoriya ;
Kobi, Mariya ;
Chernyak, Victoria .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2017, 14 (09) :1206-1211
[8]   Is Structured Reporting the Answer? [J].
Gunderman, Richard B. ;
McNeive, Logan R. .
RADIOLOGY, 2014, 273 (01) :7-9
[9]   Structured Feedback From Patients on Actual Radiology Reports: A Novel Approach to Improve Reporting Practices [J].
Gunn, Andrew J. ;
Gilcrease-Garcia, Brian ;
Mangano, Mark D. ;
Sahani, Dushyant V. ;
Boland, Giles W. ;
Choy, Garry .
AMERICAN JOURNAL OF ROENTGENOLOGY, 2017, 208 (06) :1262-1270
[10]   Big Data and Machine Learning-Strategies for Driving This Bus: A Summary of the 2016 Intersociety Summer Conference [J].
Kruskal, Jonathan B. ;
Berkowitz, Seth ;
Geis, J. Raymond ;
Kim, Woojin ;
Nagy, Paul ;
Dreyer, Keith .
JOURNAL OF THE AMERICAN COLLEGE OF RADIOLOGY, 2017, 14 (06) :811-817