Top

BMC Medical Informatics and Decision Making

Published in:

Open Access 01-12-2024 | Research

Hybrid architecture based intelligent diagnosis assistant for GP

Authors: Ruibin Wang, Kavisha Jayathunge, Rupert Page, Hailing Li, Jian Jun Zhang, Xiaosong Yang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2024

Abstract

As the first point of contact for patients, General Practitioners (GPs) play a crucial role in the National Health Service (NHS). An accurate primary diagnosis from the GP can alleviate the burden on specialists and reduce the time needed to re-confirm the patient’s condition, allowing for more efficient further examinations. However, GPs have broad but less specialized knowledge, which limits the accuracy of their diagnosis. Therefore, it is imperative to introduce an intelligent system to assist GPs in making decisions. This paper introduces two data augmentation methods, the Complaint Symptoms Integration Method and Symptom Dot Separating Method, to integrate essential information into the Integration dataset. Additionally, it proposes a hybrid architecture that fuses the features of words from different representation spaces. Experiments demonstrate that, compared to commonly used pre-trained attention-based models, our hybrid architecture delivers the best classification performance for four common neurological diseases on the enhanced Integration dataset. For example, the classification accuracy of the BERT+CNN hybrid architecture is 0.897, which is a 5.1% improvement over both BERT and CNN with 0.846. Finally, this paper develops an AI diagnosis assistant web application that leverages the superior performance of this architecture to help GPs complete primary diagnosis efficiently and accurately.

https://acd-try-it-out.mybluemix.net/

O’Donnell CA. Variation in GP referral rates: what can we learn from the literature? Fam Pract. 2000;17(6):462–71.CrossRefPubMed

Lewenberg Y, Bachrach Y, Paquet U, Rosenschein J. Knowing what to ask: A bayesian active learning approach to the surveying problem. In: AAAI. 2017;31(1). https://doi.org/10.1609/aaai.v31i1.10730.

Shim H, Hwang SJ, Yang E. Joint active feature acquisition and classification with variable-size set encoding. NeurIPS; 2018. p. 31.

Wei Z, Liu Q, Peng B, Tou H, Chen T, Huang XJ, Wong KF, Dai X. Task-oriented dialogue system for automatic diagnosis. In: ACL. 2018;2:201–7. https://aclanthology.org/P18-2033.

Xu L, Zhou Q, Gong K, Liang X, Tang J, Lin L. End-to-End Knowledge-Routed Relational Dialogue System for Automatic Diagnosis. AAAI. 2019;33:7346–53. https://doi.org/10.1609/aaai.v33i01.33017346.CrossRef

Kim Y. Convolutional Neural Networks for Sentence Classification. EMNLP. 2014. https://doi.org/10.3115/v1/D14-1181.

Conneau A, Schwenk H, Barrault L, Lecun Y. Very Deep Convolutional Networks for Text Classification. In: EACL. 2017;1:1107–16. https://aclanthology.org/E17-1104.

Mousa A, Schuller B. Contextual bidirectional long short-term memory recurrent neural network language models: a generative approach to sentiment analysis. In: EACL. Valencia; 2017. p. 1023–1032. https://aclanthology.org/E17-1096.

Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L, Gomez AN, Kaiser Ł, Polosukhin I. Attention is all you need. NeurIPS; 2017. p. 30.

10.

Alaparthi S, Mishra M. BERT: a sentiment analysis odyssey. Journal of Marketing Analytics. 2021;9. https://doi.org/10.1057/s41270-021-00109-8.

11.

Sanh V, Debut L, Chaumond J, Wolf T. DistilBERT, a distilled version of BERT: smaller, faster, cheaper and lighter. arXiv preprint arXiv:1910.01108 . 2019.

12.

Yang Z, Dai Z, Yang Y, Carbonell J, Salakhutdinov RR. Le QV. Xlnet: generalized autoregressive pretraining for language understanding. NeurIPS; 2019. p. 32.

13.

Liu Y, Ott M, Goyal N, Du J, Joshi M, Chen D, Levy O, Lewis M, Zettlemoyer L, Stoyanov V. Roberta: a robustly optimized bert pretraining approach. arXiv preprint arXiv:1907.11692 . 2019.

14.

Joachims T, Ls I, Str B. Text categorization with support vector machines: learning with many relevant features. In: ECML. 1998. p. 137–42. https://doi.org/10.1007/BFb0026683.

15.

Kononenko I. Inductive and Bayesian learning in medical diagnosis. Appl Artif Intell Int J. 1993;7(4):317–37.CrossRef

16.

Kononenko I. Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med. 2001;23(1):89–109.CrossRefPubMed

17.

Xu Z, Kusner M, Weinberger K, Chen M. Cost-sensitive tree of classifiers. In: International conference on machine learning. PMLR; 2013. p. 133–141.

18.

Kohavi R. Scaling up the accuracy of naive-bayes classifiers: a decision-tree hybrid. In: KDD. 1996;96:202–7. https://dl.acm.org/doi/10.5555/3001460.3001502.

19.

Nan F, Wang J, Saligrama V. Feature-budgeted random forest. In: International conference on machine learning. PMLR; 2015. p. 1983–1991.

20.

Hayashi Y. A neural expert system with automated extraction of fuzzy if-then rules and its application to medical diagnosis. NeurIPS; 1990. p. 3.

21.

Genkin A, Lewis DD, Madigan D. Large-scale Bayesian logistic regression for text categorization. Technometrics. 2007;49(3):291–304.CrossRef

22.

McCallum A, Nigam K. A comparison of event models for naive bayes text classification. In: AAAI. 1998;752(1):41–8. https://aaai.org/papers/041-ws98-05-007.

23.

Lin CY, Hovy E. Automatic evaluation of summaries using n-gram co-occurrence statistics. In: NAACL. 2003. p. 150–7. https://aclanthology.org/N03-1020.

24.

Zhang W, Yoshida T, Tang X. TFIDF, LSI and multi-word in information retrieval and text categorization. In: IEEE SMC. IEEE; 2008. p. 108–113.

25.

Mikolov T, Sutskever I, Chen K, Corrado GS, Dean J. Distributed representations of words and phrases and their compositionality. NeurIPS; 2013. p. 26.

26.

Zhang X, Zhao J, LeCun Y. Character-level convolutional networks for text classification. NeurIPS; 2015. p. 28.

27.

Kim Y, Lee H, Jung K. Attnconvnet at semeval-2018 task 1: Attention-based convolutional neural networks for multi-label emotion classification. arXiv preprint arXiv:1804.00831 . 2018.

28.

Zhang Y, Jiang Z, Zhang T, Liu S, Cao J, Liu K, Liu S, Zhao J. MIE: A Medical Information Extractor towards Medical Dialogues. In: ACL. 2020. p. 6460–6469. https://aclanthology.org/2020.acl-main.576.

29.

Lee J, Yoon W, Kim S, Kim D, Kim S, So CH, Kang J. BioBERT: a pre-trained biomedical language representation model for biomedical text mining. Bioinformatics. 2020;36(4):1234–40.CrossRefPubMed

30.

Zhang Y, Zhang Y, Qi P, Manning CD, Langlotz CP. Biomedical and clinical English model packages for the Stanza Python NLP library. JAMIA. 2021;28(9):1892–9.PubMedPubMedCentral

31.

Zhou M, Li Z, Tan B, Zeng G, Yang W, He X, Ju Z, Chakravorty S, Chen S, Yang X, Zhang Y. On the Generation of Medical Dialogs for COVID-19. In: ACL-IJCNLP). Online; 2021. p. 886–896. https://doi.org/10.18653/v1/2021.acl-short.112.

32.

Mikolov T, Grave E, Bojanowski P, Puhrsch C, Joulin A. Advances in pre-training distributed word representations. arXiv preprint arXiv:1712.09405 . 2017.

33.

Kingma DP, Ba J. Adam: A Method for Stochastic Optimization. In: 3rd ICLR; 2015. arXiv:1412.6980 .

Title: Hybrid architecture based intelligent diagnosis assistant for GP
Authors: Ruibin Wang
Kavisha Jayathunge
Rupert Page
Hailing Li
Jian Jun Zhang
Xiaosong Yang
Publication date: 01-12-2024
Publisher: BioMed Central
Published in: BMC Medical Informatics and Decision Making / Issue 1/2024
Electronic ISSN: 1472-6947
DOI: https://doi.org/10.1186/s12911-023-02398-8

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Hybrid architecture based intelligent diagnosis assistant for GP

Abstract

Keynote webinar | Spotlight on medication adherence

Springer Medicine

Abstract

Please log in to get access to this content

Other articles of this Issue 1/2024

Exploring the performance and explainability of fine-tuned BERT models for neuroradiology protocol assignment

Development and application of Chinese medical ontology for diabetes mellitus

An evaluation of GPT models for phenotype concept recognition

Automatic segmentation of 15 critical anatomical labels and measurements of cardiac axis and cardiothoracic ratio in fetal four chambers using nnU-NetV2

Correction: Susceptibility of AutoML mortality prediction algorithms to model drift caused by the COVID pandemic

Robot-assisted surgery and artificial intelligence-based tumour diagnostics: social preferences with a representative cross-sectional survey