Top

BMC Medical Informatics and Decision Making

Published in:

Open Access 01-12-2022 | Tinnitus | Research

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)

Authors: Jia Li, Yucong Lin, Pengfei Zhao, Wenjuan Liu, Linkun Cai, Jing Sun, Lei Zhao, Zhenghan Yang, Hong Song, Han Lv, Zhenchang Wang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2022

Abstract

Background

Given the increasing number of people suffering from tinnitus, the accurate categorization of patients with actionable reports is attractive in assisting clinical decision making. However, this process requires experienced physicians and significant human labor. Natural language processing (NLP) has shown great potential in big data analytics of medical texts; yet, its application to domain-specific analysis of radiology reports is limited.

Objective

The aim of this study is to propose a novel approach in classifying actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer BERT-based models and evaluate the benefits of in domain pre-training (IDPT) along with a sequence adaptation strategy.

Methods

A total of 5864 temporal bone computed tomography(CT) reports are labeled by two experienced radiologists as follows: (1) normal findings without notable lesions; (2) notable lesions but uncorrelated to tinnitus; and (3) at least one lesion considered as potential cause of tinnitus. We then constructed a framework consisting of deep learning (DL) neural networks and self-supervised BERT models. A tinnitus domain-specific corpus is used to pre-train the BERT model to further improve its embedding weights. In addition, we conducted an experiment to evaluate multiple groups of max sequence length settings in BERT to reduce the excessive quantity of calculations. After a comprehensive comparison of all metrics, we determined the most promising approach through the performance comparison of F1-scores and AUC values.

Results

In the first experiment, the BERT finetune model achieved a more promising result (AUC-0.868, F1-0.760) compared with that of the Word2Vec-based models(AUC-0.767, F1-0.733) on validation data. In the second experiment, the BERT in-domain pre-training model (AUC-0.948, F1-0.841) performed significantly better than the BERT based model(AUC-0.868, F1-0.760). Additionally, in the variants of BERT fine-tuning models, Mengzi achieved the highest AUC of 0.878 (F1-0.764). Finally, we found that the BERT max-sequence-length of 128 tokens achieved an AUC of 0.866 (F1-0.736), which is almost equal to the BERT max-sequence-length of 512 tokens (AUC-0.868,F1-0.760).

Conclusion

In conclusion, we developed a reliable BERT-based framework for tinnitus diagnosis from Chinese radiology reports, along with a sequence adaptation strategy to reduce computational resources while maintaining accuracy. The findings could provide a reference for NLP development in Chinese radiology reports.

Available only for authorised users

Kessler MM, Moussa M, Bykowski J, et al. ACR Appropriateness criteria((R)) tinnitus. J Am Coll Radiol. 2017;14(11S):S584–91. https://doi.org/10.1016/j.jacr.2017.08.052.CrossRefPubMed

Xu X, Bu X, Zhou L, et al. An epidemiologic study of tinnitus in a population in Jiangsu Province, China. J Am Acad Audiol. 2011;22(9):578–85. https://doi.org/10.3766/jaaa.22.9.3.CrossRefPubMed

American Tinnitus Association(ATA)[EB/OL]. Accessed at February 1. 2022. https://www.ata.org/understanding-facts/demographics.

Tunkel DE, Bauer CA, Sun GH, et al. Clinical practice guideline: tinnitus. Otolaryngol Head Neck Surg. 2014;151(2):S1–40. https://doi.org/10.1177/0194599814545325.CrossRefPubMed

Gomes RLE. Review and update of temporal bone imaging. Radiol Brasil. 2019;52(2):7–8. https://doi.org/10.1590/0100-3984.2019.52.2e2.CrossRef

Mozayan A, Fabbri AR, Maneevese M, et al. Practical guide to natural language processing for radiology. Radiographics. 2021;41(5):1446–53. https://doi.org/10.1148/rg.2021200113.CrossRefPubMed

Brady AP. Error and discrepancy in radiology: inevitable or avoidable? Insights Imag. 2017;8(1):171–82. https://doi.org/10.1007/s13244-016-0534-1.CrossRef

Shinagare AB, Lacson R, Boland GW, et al. Radiologist preferences, agreement, and variability in phrases used to convey diagnostic certainty in radiology reports. J Am Coll Radiol. 2019;16(4):458–64. https://doi.org/10.1016/j.jacr.2018.09.052.CrossRefPubMed

Itri JN, Tappouni RR, McEachern RO, et al. Fundamentals of diagnostic error in imaging. Radiographics. 2018;38(6):1845–65. https://doi.org/10.1148/rg.2018180021.CrossRefPubMed

10.

Kim SH, Sobez LM, Spiro JE, et al. Structured reporting has the potential to reduce reporting times of dual-energy x-ray absorptiometry exams. BMC Musculoskelet Disord. 2020;21(1):248. https://doi.org/10.1186/s12891-020-03200-w.CrossRefPubMedPubMedCentral

11.

Pons E, Braun LM, Hunink MG, et al. Natural language processing in radiology: a systematic review. Radiology. 2016;279(2):329–43. https://doi.org/10.1148/radiol.16142770.CrossRefPubMed

12.

Chen TL, Emerling M, Chaudhari GR, et al. Domain specific word embeddings for natural language processing in radiology. J Biomed Inform. 2021;113: 103665. https://doi.org/10.1016/j.jbi.2020.103665.CrossRefPubMed

13.

Steinkamp J, Cook TS. Basic artificial intelligence techniques: natural language processing of radiology reports. Radiol Clin North Am. 2021;59(6):919–31. https://doi.org/10.1016/j.rcl.2021.06.003.CrossRefPubMed

14.

Ong CJ, Orfanoudaki A, Zhang R, et al. Machine learning and natural language processing methods to identify ischemic stroke, acuity and location from radiology reports. PLoS ONE. 2020;15(6): e234908. https://doi.org/10.1371/journal.pone.0234908.CrossRef

15.

Liu H, Zhang Z, Xu Y, et al. Use of BERT (bidirectional encoder representations from transformers)-based deep learning method for extracting evidences in chinese radiology reports: development of a computer-aided liver cancer diagnosis framework. J Med Internet Res. 2021;23(1): e19689. https://doi.org/10.2196/19689.CrossRefPubMedPubMedCentral

16.

Nakamura Y, Hanaoka S, Nomura Y, et al. Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers. BMC Med Inform Decis Mak. 2021;21(1):262. https://doi.org/10.1186/s12911-021-01623-6.CrossRefPubMedPubMedCentral

17.

Datta S, Ulinski M, Godfrey-Stovall J, et al. Rad-spatialnet: a frame-based resource for fine-grained spatial relations in radiology reports. LREC Int Conf Lang Resour Eval. 2020;2020:2251–60.PubMedPubMedCentral

18.

Banerjee I, Ling Y, Chen MC, et al. Comparative effectiveness of convolutional neural network (CNN) and recurrent neural network (RNN) architectures for radiology text report classification. Artif Intell Med. 2019;97:79–88. https://doi.org/10.1016/j.artmed.2018.11.004.CrossRefPubMed

19.

Vaswani A, Shazeer N, Parmar N, et al. Attention is all you need. Adv Neural Inf Process Systems, 2017,30.

20.

Gao S, Qiu JX, Alawad M, et al. Classifying cancer pathology reports with hierarchical self-attention networks. Artif Intell Med. 2019;101: 101726. https://doi.org/10.1016/j.artmed.2019.101726.CrossRefPubMed

21.

Devlin J, Chang M, Lee K, et al. Bert: Pre-training of deep bidirectional transformers for language understanding. arXiv preprint arXiv:1810.04805, 2018.

22.

Han X, Zhang Z, Ding N, et al. Pre-trained models: past, present and future. AI Open. 2021;2:225–50. https://doi.org/10.1016/j.aiopen.2021.08.002.CrossRef

23.

Qiu X, Sun T, Xu Y, et al. Pre-trained models for natural language processing: a survey. Science China Technol Sci. 2020;63(10):1872–97.CrossRef

24.

Liu Y, Ott M, Goyal N, et al. Roberta: a robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692, 2019.

25.

Lan Z, Chen M, Goodman S, et al. Albert: A lite bert for self-supervised learning of language representations. arXiv preprint arXiv:1909.11942, 2019.

26.

Sun Y, Wang S, Li Y, et al. Ernie: Enhanced representation through knowledge integration. arXiv preprint arXiv:1904.09223, 2019.

27.

Huang K, Altosaar J, Ranganath R. Clinicalbert: Modeling clinical notes and predicting hospital readmission. arXiv preprint arXiv:1904.05342, 2019.

28.

Lee J, Yoon W, Kim S, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234–40.PubMed

29.

Cui Y, Che W, Liu T, et al. Pre-training with whole word masking for Chinese Bert. IEEE/ACM Trans Audio Speech Lang Process. 2021;29:3504–14. https://doi.org/10.1109/TASLP.2021.3124365.CrossRef

30.

Xiang B, Yang C, Li Y, et al. CLiMP: a benchmark for Chinese language model evaluation. arXiv preprint arXiv:2101.11131, 2021.

31.

Wang B, Pan B, Li X, et al. Towards evaluating the robustness of chinese bert classifiers. arXiv preprint arXiv:2004.03742, 2020.

32.

Soffer S, Glicksberg BS, Zimlichman E, et al. BERT for the processing of radiological reports: an attention-based natural language processing algorithm. Acad Radiol. 2022;29(4):634–5.CrossRef

33.

Carrodeguas E, Lacson R, Swanson W, et al. Use of Machine learning to identify follow-up recommendations in radiology reports. J Am Coll Radiol JACR. 2019;16(3):336–43. https://doi.org/10.1016/j.jacr.2018.10.020.CrossRefPubMed

34.

Heilbrun ME, Chapman BE, Narasimhan E, et al. Feasibility of natural language processing-assisted auditing of critical findings in chest radiology. J Am Coll Radiol JACR. 2019;16(9):1299–304. https://doi.org/10.1016/j.jacr.2019.05.038.CrossRefPubMed

35.

Lou R, Lalevic D, Chambers C, et al. Automated detection of radiology reports that require follow-up imaging using natural language processing feature engineering and machine learning classification. J Digit Imaging. 2020;33(1):131–6. https://doi.org/10.1007/s10278-019-00271-7.CrossRefPubMed

36.

Gershanik EF, Lacson R, Khorasani R. Critical finding capture in the impression section of radiology reports. AMIA Symp. 2011;2011:465–9.

37.

Morioka C, Meng F, Taira R, et al. Automatic classification of ultrasound screening examinations of the abdominal aorta. J Digital Imaging. 2016;29(6):742–8.CrossRef

38.

Fu S, Leung LY, Wang Y, et al. Natural language processing for the identification of silent brain infarcts from neuroimaging reports. JMIR Med Inform. 2019;7(2):e12109. https://doi.org/10.2196/12109.CrossRefPubMedPubMedCentral

39.

Nakamura Y, Hanaoka S, Nomura Y, et al. Automatic detection of actionable radiology reports using bidirectional encoder representations from transformers. BMC Med Inform Decision Mak. 2021;21(1):262. https://doi.org/10.1186/s12911-021-01623-6.CrossRef

40.

Jujjavarapu C, Pejaver V, Cohen TA, et al. A Comparison of natural language processing methods for the classification of lumbar spine imaging findings related to lower back pain. Acad Radiol. 2022;29(3):S188–200. https://doi.org/10.1016/j.acra.2021.09.005.CrossRefPubMed

41.

Zhang H, Hu D, Duan H, et al. A novel deep learning approach to extract Chinese clinical entities for lung cancer screening and staging. BMC Med Inform Decision Making. 2021;21(Suppl 2):214. https://doi.org/10.1186/s12911-021-01575-x.CrossRef

42.

Zaman S, Petri C, Vimalesvaran K, et al. Automatic diagnosis labeling of cardiovascular mri by using semisupervised natural language processing of text reports. Radiol Artif Intell. 2022;4(1):e210085. https://doi.org/10.1148/ryai.210085.CrossRefPubMed

43.

Liu F, Zhou P, Baccei SJ, et al. qualifying certainty in radiology reports through deep learning-based natural language processing. AJNR Am J Neuroradiol. 2021;42(10):1755–61. https://doi.org/10.3174/ajnr.A7241.CrossRefPubMed

44.

Cima R, Mazurek B, Haider H, et al. A multidisciplinary European guideline for tinnitus: diagnostics, assessment, and treatment. HNO. 2019;67(Suppl 1):10–42. https://doi.org/10.1007/s00106-019-0633-7.CrossRefPubMed

45.

Mosbach M, Andriushchenko M, Klakow D. On the stability of fine-tuning bert: Misconceptions, explanations, and strong baselines. arXiv preprint arXiv:2006.04884, 2020.

46.

Cui Y, Che W, Liu T, et al. Revisiting pre-trained models for Chinese natural language processing. arXiv preprint arXiv:2004.13922, 2020.

47.

Zhang Z, Zhang H, Chen K, et al. Mengzi: towards lightweight yet ingenious pre-trained models for Chinese. arXiv preprint arXiv:2110.06696, 2021.

48.

Sun C, Qiu X, Xu Y, et al. How to fine-tune bert for text classification? In: China national conference on Chinese computational linguistics, 2019. Springer.

49.

Brady AP. Radiology reporting-from Hemingway to HAL? Insights Imaging. 2018;9(2):237–46. https://doi.org/10.1007/s13244-018-0596-3.CrossRefPubMedPubMedCentral

50.

Lu W, Jiao J, Zhang R. Twinbert: Distilling knowledge to twin-structured compressed bert models for large-scale retrieval. In: Proceedings of the 29th ACM International Conference on Information & Knowledge Management, 2020.

51.

Hardavella G, Aamli-Gaagnat A, Frille A, et al. Top tips to deal with challenging situations: doctor-patient interactions. Breathe (Sheff). 2017;13(2):129–35. https://doi.org/10.1183/20734735.006616.CrossRef

52.

Gregory W. Rutecki. Tinnitus recommendations: what to do when there is ringing in the Ears. Consultant. 2016;56(11):1036.

53.

Masino AJ, Grundmeier RW, Pennington JW, et al. Temporal bone radiology report classification using open source machine learning and natural langue processing libraries. BMC Med Inform Decis Mak. 2016;16:65. https://doi.org/10.1186/s12911-016-0306-3.CrossRefPubMedPubMedCentral

Title: Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)
Authors: Jia Li
Yucong Lin
Pengfei Zhao
Wenjuan Liu
Linkun Cai
Jing Sun
Lei Zhao
Zhenghan Yang
Hong Song
Han Lv
Zhenchang Wang
Publication date: 01-12-2022
Publisher: BioMed Central
Keyword: Tinnitus
Published in: BMC Medical Informatics and Decision Making / Issue 1/2022
Electronic ISSN: 1472-6947
DOI: https://doi.org/10.1186/s12911-022-01946-y

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Automatic text classification of actionable radiology reports of tinnitus patients using bidirectional encoder representations from transformer (BERT) and in-domain pre-training (IDPT)

Abstract

Background

Objective

Methods

Results

Conclusion

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Abstract

Background

Objective

Methods

Results

Conclusion

Please log in to get access to this content

Other articles of this Issue 1/2022

Comparison of different feature extraction methods for applicable automated ICD coding

Probabilistic linguistic fuzzy cognitive maps: applications to the critical factors affecting the health of rural older adults

Hellenic registry of patients with home mechanical ventilation (HR-HMV): profiling sleep Apnea–Hypopnea syndrome patients across Greece

Mobile health platform based on user-centered design to promote exercise for patients with peripheral artery disease

Clinical decision support system for quality of life among the elderly: an approach using artificial neural network

OntoPharma: ontology based clinical decision support system to reduce medication prescribing errors