Skip to main content
Top
Published in: Journal of Digital Imaging 4/2019

Open Access 01-08-2019

Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning

Authors: Jackson M. Steinkamp, Charles Chambers, Darco Lalevic, Hanna M. Zafar, Tessa S. Cook

Published in: Journal of Imaging Informatics in Medicine | Issue 4/2019

Login to get access

Abstract

Unstructured and semi-structured radiology reports represent an underutilized trove of information for machine learning (ML)-based clinical informatics applications, including abnormality tracking systems, research cohort identification, point-of-care summarization, semi-automated report writing, and as a source of weak data labels for training image processing systems. Clinical ML systems must be interpretable to ensure user trust. To create interpretable models applicable to all of these tasks, we can build general-purpose systems which extract all relevant human-level assertions or “facts” documented in reports; identifying these facts is an information extraction (IE) task. Previous IE work in radiology has focused on a limited set of information, and extracts isolated entities (i.e., single words such as “lesion” or “cyst”) rather than complete facts, which require the linking of multiple entities and modifiers. Here, we develop a prototype system to extract all useful information in abdominopelvic radiology reports (findings, recommendations, clinical history, procedures, imaging indications and limitations, etc.), in the form of complete, contextualized facts. We construct an information schema to capture the bulk of information in reports, develop real-time ML models to extract this information, and demonstrate the feasibility and performance of the system.
Appendix
Available only for authorised users
Literature
2.
go back to reference European Society of Radiology (ESR): ESR paper on structured reporting in radiology. Insights Imaging 9(1):1–7, 2018CrossRef European Society of Radiology (ESR): ESR paper on structured reporting in radiology. Insights Imaging 9(1):1–7, 2018CrossRef
3.
go back to reference Singh S: Natural Language Processing for Information Extraction. arXiv [cs.CL], 2018 Singh S: Natural Language Processing for Information Extraction. arXiv [cs.CL], 2018
4.
go back to reference Hassanpour S, Langlotz CP: Information extraction from multi-institutional radiology reports. Artif Intell Med 66:29–39, 2016CrossRefPubMed Hassanpour S, Langlotz CP: Information extraction from multi-institutional radiology reports. Artif Intell Med 66:29–39, 2016CrossRefPubMed
5.
go back to reference Nandhakumar N, Sherkat E, Milios EE, Gu H, Butler M: Clinically Significant Information Extraction from Radiology Reports. In: Proceedings of the 2017 ACM Symposium on Document Engineering - DocEng ’17. ACM Press, 2017, pp 153–162 Nandhakumar N, Sherkat E, Milios EE, Gu H, Butler M: Clinically Significant Information Extraction from Radiology Reports. In: Proceedings of the 2017 ACM Symposium on Document Engineering - DocEng ’17. ACM Press, 2017, pp 153–162
6.
go back to reference Esuli A, Marcheggiani D, Sebastiani F: An enhanced CRFs-based system for information extraction from radiology reports. J. Biomed. Inform. 46:425–435, 2013CrossRefPubMed Esuli A, Marcheggiani D, Sebastiani F: An enhanced CRFs-based system for information extraction from radiology reports. J. Biomed. Inform. 46:425–435, 2013CrossRefPubMed
7.
go back to reference Cornegruta S, Bakewell R, Withey S, Montana G: Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks. arXiv [cs.CL], 2016 Cornegruta S, Bakewell R, Withey S, Montana G: Modelling Radiological Language with Bidirectional Long Short-Term Memory Networks. arXiv [cs.CL], 2016
8.
go back to reference Pawar S, Palshikar GK, Bhattacharyya P: Relation Extraction: A Survey. arXiv [cs.CL], 2017 Pawar S, Palshikar GK, Bhattacharyya P: Relation Extraction: A Survey. arXiv [cs.CL], 2017
9.
go back to reference Huang L, Sil A, Ji H, Florian R: Improving Slot Filling Performance with Attentive Neural Networks on Dependency Structures. arXiv [cs.CL], 2017 Huang L, Sil A, Ji H, Florian R: Improving Slot Filling Performance with Attentive Neural Networks on Dependency Structures. arXiv [cs.CL], 2017
10.
go back to reference Michael J, Stanovsky G, He L, Dagan I, Zettlemoyer L: Crowdsourcing Question-Answer Meaning Representations Michael J, Stanovsky G, He L, Dagan I, Zettlemoyer L: Crowdsourcing Question-Answer Meaning Representations
11.
go back to reference Rajpurkar P, Zhang J, Lopyrev K, Liang P: SQuAD: 100,000+ Questions for Machine Comprehension of Text. arXiv [cs.CL], 2016 Rajpurkar P, Zhang J, Lopyrev K, Liang P: SQuAD: 100,000+ Questions for Machine Comprehension of Text. arXiv [cs.CL], 2016
12.
go back to reference Devlin J, Chang M-W, Lee K, Toutanova K: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv [cs.CL], 2018 Devlin J, Chang M-W, Lee K, Toutanova K: BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding. arXiv [cs.CL], 2018
13.
go back to reference Vaswani A, et al: Attention Is All You Need. arXiv [cs.CL], 2017 Vaswani A, et al: Attention Is All You Need. arXiv [cs.CL], 2017
Metadata
Title
Toward Complete Structured Information Extraction from Radiology Reports Using Machine Learning
Authors
Jackson M. Steinkamp
Charles Chambers
Darco Lalevic
Hanna M. Zafar
Tessa S. Cook
Publication date
01-08-2019
Publisher
Springer International Publishing
Published in
Journal of Imaging Informatics in Medicine / Issue 4/2019
Print ISSN: 2948-2925
Electronic ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-019-00234-y

Other articles of this Issue 4/2019

Journal of Digital Imaging 4/2019 Go to the issue