Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2019

Open Access 01-12-2019 | Magnetic Resonance Imaging | Research article

The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports

Authors: Yi Liu, Qing Liu, Chao Han, Xiaodong Zhang, Xiaoying Wang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2019

Login to get access

Abstract

Background

There are often multiple lesions in breast magnetic resonance imaging (MRI) reports and radiologists usually focus on describing the index lesion that is most crucial to clinicians in determining the management and prognosis of patients. Natural language processing (NLP) has been used for information extraction from mammography reports. However, few studies have investigated NLP in breast MRI data based on free-form text. The objective of the current study was to assess the validity of our NLP program to accurately extract index lesions and their corresponding imaging features from free-form text of breast MRI reports.

Methods

This cross-sectional study examined 1633 free-form text reports of breast MRIs from 2014 to 2017. First, the NLP system was used to extract 9 features from all the lesions in the reports according to the Breast Imaging Reporting and Data System (BI-RADS) descriptors. Second, the index lesion was defined as the lesion with the largest number of imaging features. Third, we extracted the values of each imaging feature and the BI-RADS category from each index lesion. To evaluate the accuracy of our system, 478 reports were manually reviewed by two individuals. The time taken to extract data by NLP was compared with that by reviewers.

Results

The NLP system extracted 889 lesions from 478 reports. The mean number of imaging features per lesion was 6.5 ± 2.1 (range: 3–9; 95% CI: 6.362–6.638). The mean number of imaging features per index lesion was 8.0 ± 1.1 (range: 5–9; 95% CI: 7.901–8.099). The NLP system demonstrated a recall of 100.0% and a precision of 99.6% for correct identification of the index lesion. The recall and precision of NLP to correctly extract the value of imaging features from the index lesions were 91.0 and 92.6%, respectively. The recall and precision for the correct identification of the BI-RADS categories were 96.6 and 94.8%, respectively. NLP generated the total results in less than 1 s, whereas the manual reviewers averaged 4.47 min and 4.56 min per report.

Conclusions

Our NLP method successfully extracted the index lesion and its corresponding information from free-form text.
Literature
8.
go back to reference Zhang Y, Fukatsu H, Naganawa S, Satake H, Sato Y, Ohiwa M, et al. The role of contrast-enhanced MR mammography for determining candidates for breast conservation surgery. Breast Cancer. 2002;9(3):231–9.CrossRefPubMed Zhang Y, Fukatsu H, Naganawa S, Satake H, Sato Y, Ohiwa M, et al. The role of contrast-enhanced MR mammography for determining candidates for breast conservation surgery. Breast Cancer. 2002;9(3):231–9.CrossRefPubMed
11.
go back to reference Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;1:128–44. Meystre SM, Savova GK, Kipper-Schuler KC, Hurdle JF. Extracting information from textual documents in the electronic health record: a review of recent research. Yearb Med Inform. 2008;1:128–44.
16.
17.
go back to reference Jain NL, Friedman C. Identification of findings suspicious for breast cancer based on natural language processing of mammogram reports. Proc AMIA Annu Fall Symp. 1997:829–33. Jain NL, Friedman C. Identification of findings suspicious for breast cancer based on natural language processing of mammogram reports. Proc AMIA Annu Fall Symp. 1997:829–33.
19.
go back to reference Smitherman E, Hernandez A, Stavinoha PL, Huang R, Kernie SG, Diaz-Arrastia R, Miles DK. Predicting outcomes after pediatric traumatic brain injury by early magnetic resonance imaging lesion location and volume. J Neurotrauma. 2016 1;33(1):35–48.CrossRefPubMedPubMedCentral Smitherman E, Hernandez A, Stavinoha PL, Huang R, Kernie SG, Diaz-Arrastia R, Miles DK. Predicting outcomes after pediatric traumatic brain injury by early magnetic resonance imaging lesion location and volume. J Neurotrauma. 2016 1;33(1):35–48.CrossRefPubMedPubMedCentral
20.
go back to reference Liu D, Scalzo F, Starkman S, Rao NM, Hinman JD, Kim D, et al. DWI lesion patterns predict outcome in stroke patients with thrombolysis. Cerebrovasc Dis. 2015;40(5–6):279–85.CrossRefPubMed Liu D, Scalzo F, Starkman S, Rao NM, Hinman JD, Kim D, et al. DWI lesion patterns predict outcome in stroke patients with thrombolysis. Cerebrovasc Dis. 2015;40(5–6):279–85.CrossRefPubMed
21.
go back to reference Allemani C, Minicozzi P, Berrino F, Bastiaannet E, Gavin A, Galceran J, et al. Predictions of survival up to 10 years after diagnosis for European women with breast cancer in 2000-2002. Int J Cancer. 2013 May 15;132(10):2404–12.CrossRefPubMed Allemani C, Minicozzi P, Berrino F, Bastiaannet E, Gavin A, Galceran J, et al. Predictions of survival up to 10 years after diagnosis for European women with breast cancer in 2000-2002. Int J Cancer. 2013 May 15;132(10):2404–12.CrossRefPubMed
24.
go back to reference Chaudhry B, Wang J, Wu S, Maglione M, Mojica W, Roth E, et al. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med. 2006;144(10):742–52.CrossRefPubMed Chaudhry B, Wang J, Wu S, Maglione M, Mojica W, Roth E, et al. Systematic review: impact of health information technology on quality, efficiency, and costs of medical care. Ann Intern Med. 2006;144(10):742–52.CrossRefPubMed
Metadata
Title
The implementation of natural language processing to extract index lesions from breast magnetic resonance imaging reports
Authors
Yi Liu
Qing Liu
Chao Han
Xiaodong Zhang
Xiaoying Wang
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2019
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-019-0997-3

Other articles of this Issue 1/2019

BMC Medical Informatics and Decision Making 1/2019 Go to the issue