Top

BMC Medical Informatics and Decision Making

Published in:

Open Access 01-12-2023 | Neck Pain | Research

Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines

Authors: Yucong Lin, Jia Li, Huan Xiao, Lujie Zheng, Ying Xiao, Hong Song, Jingfan Fan, Deqiang Xiao, Danni Ai, Tianyu Fu, Feifei Wang, Han Lv, Jian Yang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2023

Abstract

Background

Clinical practice guidelines (CPGs) are designed to assist doctors in clinical decision making. High-quality research articles are important for the development of good CPGs. Commonly used manual screening processes are time-consuming and labor-intensive. Artificial intelligence (AI)-based techniques have been widely used to analyze unstructured data, including texts and images. Currently, there are no effective/efficient AI-based systems for screening literature. Therefore, developing an effective method for automatic literature screening can provide significant advantages.

Methods

Using advanced AI techniques, we propose the Paper title, Abstract, and Journal (PAJO) model, which treats article screening as a classification problem. For training, articles appearing in the current CPGs are treated as positive samples. The others are treated as negative samples. Then, the features of the texts (e.g., titles and abstracts) and journal characteristics are fully utilized by the PAJO model using the pretrained bidirectional-encoder-representations-from-transformers (BERT) model. The resulting text and journal encoders, along with the attention mechanism, are integrated in the PAJO model to complete the task.

Results

We collected 89,940 articles from PubMed to construct a dataset related to neck pain. Extensive experiments show that the PAJO model surpasses the state-of-the-art baseline by 1.91% (F1 score) and 2.25% (area under the receiver operating characteristic curve). Its prediction performance was also evaluated with respect to subject-matter experts, proving that PAJO can successfully screen high-quality articles.

Conclusions

The PAJO model provides an effective solution for automatic literature screening. It can screen high-quality articles on neck pain and significantly improve the efficiency of CPG development. The methodology of PAJO can also be easily extended to other diseases for literature screening.

https://pubmed.ncbi.nlm.nih.gov/.

Chen Y, Yang K, Marušić A, Qaseem A, Meerpohl JJ, Flottorp S, et al. A reporting tool for practice guidelines in health care: the RIGHT statement. Ann Intern Med. 2017;166:128–32.CrossRefPubMed

Shekelle PG. Clinical practice guidelines: what’s Next? J Am Med Assoc. 2018;320:757–8.CrossRef

Fire M, Guestrin C. Over-optimization of academic publishing metrics: observing Goodhart’s Law in action. GigaScience. 2019;8(6):giz053.CrossRefPubMedPubMedCentral

Harmsen W, de Groot J, Harkema A, van Dusseldorp I, De Bruin J, Van den Brand S et al. Artificial intelligence supports literature screening in medical guideline development: Towards up-to-date medical guidelines. Medicine. 2021. https://doi.org/10.5281/ZENODO.5031907.

Feng Y, Liang S, Zhang Y, Chen S, Wang Q, Huang T, et al. Automated medical literature screening using artificial intelligence: a systematic review and meta-analysis. J Am Med Inform Assoc. 2022;29:1425–32.CrossRefPubMedPubMedCentral

Dessi D, Helaoui R, Kumar V et al. TF-IDF vs word embeddings for morbidity identification in clinical notes: an initial study. 2021;DOI https://doi.org/10.5281/zenodo.4777594.

Kumar V, Recupero DR, Riboni D, et al. Ensembling classical machine learning and deep learning approaches for morbidity identification from clinical notes. IEEE Access. 2020;9:7107–26.CrossRef

Wu S, Roberts K, Datta S, Du J, Ji Z, Si Y, et al. Deep learning in clinical natural language processing: a methodical review. J Am Med Inform Assoc. 2020;27:457–70.CrossRefPubMed

Mullenbach J, Wiegreffe S, Duke J, Sun J, Eisenstein J. Explainable prediction of medical codes from clinical text. 2018; DOI:https://doi.org/10.18653/v1/N18-1100.

10.

Prabhakar SK, Won DO. Medical text classification using hybrid deep learning models with multihead attention. Comput Intell Neurosci. 2021;2021:9425655.CrossRefPubMedPubMedCentral

11.

Zhang Y, Liang S, Feng Y, Wang Q, Sun F, Chen S, et al. Automation of literature screening using machine learning in medical evidence synthesis: a diagnostic test accuracy systematic review protocol. Syst Rev. 2022;11:11.CrossRefPubMedPubMedCentral

12.

Lan Z, Chen M, Goodman S, Gimpel K, Sharma P, Soricut R. ALBERT: A lite BERT for self-supervised learning of language representations. In International Conference on Learning Representations. 2020:1311–28.

13.

Beltagy I, Lo K, Cohan A. SciBERT: A Pretrained Language Model for Scientific Text. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing. 2019:3615–20.

14.

Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, et al. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36:1234–40.CrossRefPubMed

15.

Moen H, Alhuwail D, Björne J, et al. Towards Automated Screening of Literature on Artificial Intelligence in Nursing. Stud Health Technol Inform. 2022;290:637–40.

16.

Kumar V, Recupero DR, Helaoui R, et al. K-LM: knowledge augmenting in Language Models within the Scholarly Domain. IEEE Access. 2022;10:91802–15.CrossRef

17.

Lin TY, Goyal P, Girshick R, He K, Dollar P. Focal loss for dense object detection. IEEE Trans Pattern Anal Mach Intell. 2020;42:318–27.CrossRefPubMed

18.

Gu Y, Tinn R, Cheng H, et al. Domain-specific language model pretraining for biomedical natural language processing. ACM Trans Comput Healthc. 2021;3(1):1–23.

19.

Garfield E. The history and meaning of the journal impact factor. J Am Med Assoc. 2006;295:90–3.CrossRef

20.

Van Noorden R. Impact factor gets heavyweight rival. J Cit Rep. 2016;30:20.

21.

Falagas ME, Kouranos VD, Arencibia-Jorge R, Karageorgopoulos DE. Comparison of SCImago journal rank indicator with journal impact factor. FASEB J. 2008;22:2623–8.CrossRefPubMed

22.

Leydesdorff L, Opthof T. Scopus’s source normalized impact per paper (SNIP) versus a journal impact factor based on fractional counting of citations. J Am Soc Inf Sci. 2010;61:2365–9.CrossRef

23.

Roldan-Valadez E, Salazar-Ruiz SY, Ibarra-Contreras R, Rios C. Current concepts on bibliometrics: a brief review about impact factor, eigenfactor score, CiteScore, SCImago journal rank, source-normalised impact per paper, H-index, and alternative metrics. Ir J Med Sci. 2019;188:939–51.CrossRefPubMed

24.

Devlin J, Chang MW, Lee K, Toutanova K, Bert. Pre-training of deep bidirectional transformers for language understanding. In Annual Conference of the North American Chapter of the Association for Computational Linguistics. 2019:4171–86.

25.

Sun Y, Li Y, Zeng Q, et al. Application research of text classification based on random forest algorithm. In 2020 3rd International Conference on Advanced Electronic Materials, Computers and Software Engineering (AEMCSE). 2020:370–4.

26.

Aseervatham S, Antoniadis A, Gaussier E, Burlet M, Denneulin Y. A sparse version of the ridge logistic regression for large-scale text categorization. Pattern Recognit Lett. 2011;32:101–6.CrossRef

27.

Qing L, Linhong W, Xuehai D. A novel neural network-based method for medical text classification. Future Internet. 2019;11:255.CrossRef

28.

Deng J, Cheng L, Wang Z. Attention-based BiLSTM fused CNN with gating mechanism model for chinese long text classification. Comput Speech Lang. 2021;68:101182.CrossRef

29.

Kim Y. Convolutional neural networks for sentence classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP). 2014:1746–51.

30.

Lai S, Xu L, Liu K, et al. Recurrent convolutional neural networks for text classification. In The 29th AAAI Conference on Artificial Intelligence. 2015:2267–73.

31.

Pan L, Lim WH, Gan Y. A method of Sustainable Development for three Chinese short-text datasets based on BERT-CAM. Electronics. 2023;12(7):1531.CrossRef

32.

Mingyu J, Jiawei Z, Ning W. AFR-BERT: attention-based mechanism feature relevance fusion multimodal sentiment analysis model. PLoS ONE. 2022;17(9):e0273936.CrossRefPubMedPubMedCentral

Title: Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines
Authors: Yucong Lin
Jia Li
Huan Xiao
Lujie Zheng
Ying Xiao
Hong Song
Jingfan Fan
Deqiang Xiao
Danni Ai
Tianyu Fu
Feifei Wang
Han Lv
Jian Yang
Publication date: 01-12-2023
Publisher: BioMed Central
Keyword: Neck Pain
Published in: BMC Medical Informatics and Decision Making / Issue 1/2023
Electronic ISSN: 1472-6947
DOI: https://doi.org/10.1186/s12911-023-02328-8

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Automatic literature screening using the PAJO deep-learning model for clinical practice guidelines

Abstract

Background

Methods

Results

Conclusions

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Abstract

Background

Methods

Results

Conclusions

Please log in to get access to this content

Other articles of this Issue 1/2023

Development and usability evaluation of a mHealth application for albinism self-management

Correction to: Adaptation and validation of a coding algorithm for the Charlson Comorbidity Index in administrative claims data using the SNOMED CT standardized vocabulary

Resource use and cost associated with computerized decision support system and usual care in managing patients with atrial fibrillation: analysis of IMPACT-AF randomized trial data

Interpreting deep learning models for glioma survival classification using visualization and textual explanations

Forecasting the daily demand for emergency medical ambulances in England and Wales: a benchmark model and external validation

Evaluating machine learning algorithms to Predict 30-day Unplanned REadmission (PURE) in Urology patients