Skip to main content
Top
Published in:

Open Access 01-12-2024 | Thyroid Disease | Research

Enhanced interpretable thyroid disease diagnosis by leveraging synthetic oversampling and machine learning models

Authors: Ali Raza, Fatma Eid, Elisabeth Caro Montero, Irene Delgado Noya, Imran Ashraf

Published in: BMC Medical Informatics and Decision Making | Issue 1/2024

Login to get access

Abstract

Thyroid illness encompasses a range of disorders affecting the thyroid gland, leading to either hyperthyroidism or hypothyroidism, which can significantly impact metabolism and overall health. Hypothyroidism can cause a slowdown in bodily processes, leading to symptoms such as fatigue, weight gain, depression, and cold sensitivity. Hyperthyroidism can lead to increased metabolism, causing symptoms like rapid weight loss, anxiety, irritability, and heart palpitations. Prompt diagnosis and appropriate treatment are crucial in managing thyroid disorders and improving patients’ quality of life. Thyroid illness affects millions worldwide and can significantly impact their quality of life if left untreated. This research aims to propose an effective artificial intelligence-based approach for the early diagnosis of thyroid illness. An open-access thyroid disease dataset based on 3,772 male and female patient observations is used for this research experiment. This study uses the nominal continuous synthetic minority oversampling technique (SMOTE-NC) for data balancing and a fine-tuned light gradient booster machine (LGBM) technique to diagnose thyroid illness and handle class imbalance problems. The proposed SNL (SMOTE-NC-LGBM) approach outperformed the state-of-the-art approach with high-accuracy performance scores of 0.96. We have also applied advanced machine learning and deep learning methods for comparison to evaluate performance. Hyperparameter optimizations are also conducted to enhance thyroid diagnosis performance. In addition, we have applied the explainable Artificial Intelligence (XAI) mechanism based on Shapley Additive exPlanations (SHAP) to enhance the transparency and interpretability of the proposed method by analyzing the decision-making processes. The proposed research revolutionizes the diagnosis of thyroid disorders efficiently and helps specialties overcome thyroid disorders early.
Literature
1.
go back to reference Economidou F, Douka E, Tzanela M, Nanas S, Kotanidou A. Thyroid function during critical illness. Hormones. 2011;10(2):117–24.CrossRefPubMed Economidou F, Douka E, Tzanela M, Nanas S, Kotanidou A. Thyroid function during critical illness. Hormones. 2011;10(2):117–24.CrossRefPubMed
2.
go back to reference De Luca R, Davis PJ, Lin HY, Gionfra F, Percario ZA, Affabris E, et al. Thyroid hormones interaction with immune response, inflammation and non-thyroidal illness syndrome. Front Cell Dev Biol. 2021;8:614030.CrossRefPubMedPubMedCentral De Luca R, Davis PJ, Lin HY, Gionfra F, Percario ZA, Affabris E, et al. Thyroid hormones interaction with immune response, inflammation and non-thyroidal illness syndrome. Front Cell Dev Biol. 2021;8:614030.CrossRefPubMedPubMedCentral
3.
go back to reference Sinkó R, Mohácsik P, Kővári D, Penksza V, Wittmann G, Mácsai L, et al. Different hypothalamic mechanisms control decreased circulating thyroid hormone levels in infection and fasting-induced Non-Thyroidal Illness Syndrome in male Thyroid Hormone Action Indicator Mice. Thyroid. 2023;33(1):109–18.CrossRefPubMedPubMedCentral Sinkó R, Mohácsik P, Kővári D, Penksza V, Wittmann G, Mácsai L, et al. Different hypothalamic mechanisms control decreased circulating thyroid hormone levels in infection and fasting-induced Non-Thyroidal Illness Syndrome in male Thyroid Hormone Action Indicator Mice. Thyroid. 2023;33(1):109–18.CrossRefPubMedPubMedCentral
4.
go back to reference Sipos JA, Ringel MD. Molecular testing in thyroid cancer diagnosis and management. Best Pract Res Clin Endocrinol Metab. 2023;37(1):101680.CrossRefPubMed Sipos JA, Ringel MD. Molecular testing in thyroid cancer diagnosis and management. Best Pract Res Clin Endocrinol Metab. 2023;37(1):101680.CrossRefPubMed
5.
6.
go back to reference Riis J, Kragholm K, Torp-Pedersen C, Andersen S. Association between thyroid function, nursing home admission and mortality in community-dwelling adults over 80 years. Arch Gerontol Geriatr. 2023;104:104806.CrossRefPubMed Riis J, Kragholm K, Torp-Pedersen C, Andersen S. Association between thyroid function, nursing home admission and mortality in community-dwelling adults over 80 years. Arch Gerontol Geriatr. 2023;104:104806.CrossRefPubMed
7.
go back to reference Purohit J, Barjatya R, Kataria SK. Evaluation of Hyperprolactinemia and Thyroid Disorder among Women with Dysfunctional Uterine Bleeding at Tertiary Care Hospital of western Rajasthan. Sch Int J Anat Physiol. 2023;6(5):61–3. Purohit J, Barjatya R, Kataria SK. Evaluation of Hyperprolactinemia and Thyroid Disorder among Women with Dysfunctional Uterine Bleeding at Tertiary Care Hospital of western Rajasthan. Sch Int J Anat Physiol. 2023;6(5):61–3.
8.
go back to reference Zhang X, Lee VC, Rong J, Liu F, Kong H. Multi-channel convolutional neural network architectures for thyroid cancer detection. PLoS ONE. 2022;17(1):e0262128.CrossRefPubMedPubMedCentral Zhang X, Lee VC, Rong J, Liu F, Kong H. Multi-channel convolutional neural network architectures for thyroid cancer detection. PLoS ONE. 2022;17(1):e0262128.CrossRefPubMedPubMedCentral
9.
go back to reference Fiorentino V, Pizzimenti C, Franchina M, Micali MG, Russotto F, Pepe L, et al. The minefield of indeterminate thyroid nodules: could artificial intelligence be a suitable diagnostic tool? Diagn Histopathology. USA: Elsevier; 2023. Fiorentino V, Pizzimenti C, Franchina M, Micali MG, Russotto F, Pepe L, et al. The minefield of indeterminate thyroid nodules: could artificial intelligence be a suitable diagnostic tool? Diagn Histopathology. USA: Elsevier; 2023.
10.
go back to reference Aversano L, Bernardi ML, Cimitile M, Maiellaro A, Pecori R. A systematic review on artificial intelligence techniques for detecting thyroid diseases. PeerJ Comput Sci. 2023;9:e1394.CrossRefPubMedPubMedCentral Aversano L, Bernardi ML, Cimitile M, Maiellaro A, Pecori R. A systematic review on artificial intelligence techniques for detecting thyroid diseases. PeerJ Comput Sci. 2023;9:e1394.CrossRefPubMedPubMedCentral
11.
go back to reference Imans D, Abuhmed T, Alharbi M, El-Sappagh S. Explainable Multi-Layer Dynamic Ensemble Framework Optimized for Depression Detection and Severity Assessment. Diagnostics. 2024;14(21):2385.CrossRefPubMedPubMedCentral Imans D, Abuhmed T, Alharbi M, El-Sappagh S. Explainable Multi-Layer Dynamic Ensemble Framework Optimized for Depression Detection and Severity Assessment. Diagnostics. 2024;14(21):2385.CrossRefPubMedPubMedCentral
12.
go back to reference Saleh H, El-Rashidy N, Abuhmed T, El-Sappagh SLSTM, deep learning model for Alzheimer’s disease prediction based on cost-effective time series cognitive scores. In: 2023 5th Novel Intelligent and Leading Emerging Sciences Conference (NILES). IEEE; 2023. pp. 1–6. Saleh H, El-Rashidy N, Abuhmed T, El-Sappagh SLSTM, deep learning model for Alzheimer’s disease prediction based on cost-effective time series cognitive scores. In: 2023 5th Novel Intelligent and Leading Emerging Sciences Conference (NILES). IEEE; 2023. pp. 1–6.
13.
go back to reference Rahim N, El-Sappagh S, Rizk H, El-serafy OA, Abuhmed T. Information fusion-based Bayesian optimized heterogeneous deep ensemble model based on longitudinal neuroimaging data. Appl Soft Comput. 2024;162:111749.CrossRef Rahim N, El-Sappagh S, Rizk H, El-serafy OA, Abuhmed T. Information fusion-based Bayesian optimized heterogeneous deep ensemble model based on longitudinal neuroimaging data. Appl Soft Comput. 2024;162:111749.CrossRef
16.
go back to reference Nandi-Munshi D, Taplin CE. Thyroid-related neurological disorders and complications in children. Pediatr Neurol. 2015;52(4):373–82.CrossRefPubMed Nandi-Munshi D, Taplin CE. Thyroid-related neurological disorders and complications in children. Pediatr Neurol. 2015;52(4):373–82.CrossRefPubMed
17.
go back to reference Hossain MB, Shama A, Adhikary A, Raha AD, Uddin KA, Hossain MA, et al. An Explainable Artificial Intelligence Framework for the Predictive Analysis of Hypo and Hyper Thyroidism Using Machine Learning Algorithms. Hum-Centric Intell Syst. 2023;3:1–21. Hossain MB, Shama A, Adhikary A, Raha AD, Uddin KA, Hossain MA, et al. An Explainable Artificial Intelligence Framework for the Predictive Analysis of Hypo and Hyper Thyroidism Using Machine Learning Algorithms. Hum-Centric Intell Syst. 2023;3:1–21.
19.
go back to reference Islam SS, Haque MS, Miah MSU, Sarwar TB, Nugraha R. Application of machine learning algorithms to predict the thyroid disease risk: an experimental comparative study. PeerJ Comput Sci. 2022;8:e898.CrossRefPubMedPubMedCentral Islam SS, Haque MS, Miah MSU, Sarwar TB, Nugraha R. Application of machine learning algorithms to predict the thyroid disease risk: an experimental comparative study. PeerJ Comput Sci. 2022;8:e898.CrossRefPubMedPubMedCentral
21.
go back to reference Gök EC, Olgun MO. SMOTE-NC and gradient boosting imputation based random forest classifier for predicting severity level of covid-19 patients with blood samples. Neural Comput & Applic. 2021;33(22):15693–707.CrossRef Gök EC, Olgun MO. SMOTE-NC and gradient boosting imputation based random forest classifier for predicting severity level of covid-19 patients with blood samples. Neural Comput & Applic. 2021;33(22):15693–707.CrossRef
23.
go back to reference Wibowo W, Muhaimin A, Abdul-Rahman S. Predicting Internet Usage for Digital Finance Services: Multitarget Classification Using Vector Generalized Additive Model with SMOTE-NC. In: The International Conference on Data Science and Emerging Technologies. Springer; 2022. pp. 494–504. Wibowo W, Muhaimin A, Abdul-Rahman S. Predicting Internet Usage for Digital Finance Services: Multitarget Classification Using Vector Generalized Additive Model with SMOTE-NC. In: The International Conference on Data Science and Emerging Technologies. Springer; 2022. pp. 494–504.
24.
go back to reference Chen Jh, Zhang YQ, Zhu Tt, Zhang Q, Zhao Ax, Huang Y. Applying machine-learning models to differentiate benign and malignant thyroid nodules classified as C-TIRADS 4 based on 2D-ultrasound combined with five contrast-enhanced ultrasound key frames. Front Endocrinol. 2024;15:1299686. Chen Jh, Zhang YQ, Zhu Tt, Zhang Q, Zhao Ax, Huang Y. Applying machine-learning models to differentiate benign and malignant thyroid nodules classified as C-TIRADS 4 based on 2D-ultrasound combined with five contrast-enhanced ultrasound key frames. Front Endocrinol. 2024;15:1299686.
25.
go back to reference Brindha V, Muthukumaravel A. Efficient Method for the prediction of Thyroid Disease Classification Using Support Vector Machine and Logistic Regression. In: Computational Intelligence for Clinical Diagnosis. Springer; 2023. pp. 37–45. Brindha V, Muthukumaravel A. Efficient Method for the prediction of Thyroid Disease Classification Using Support Vector Machine and Logistic Regression. In: Computational Intelligence for Clinical Diagnosis. Springer; 2023. pp. 37–45.
26.
go back to reference Jakkulla PK, Ganesh KM, Jayapal PK, Malla SJ, Chandanapalli SB, Sandhya E. Selection of Features Using Adaptive Tunicate Swarm Algorithm with Optimized Deep Learning Model for Thyroid Disease Classification. Ingenierie Systemes Inf. 2023;28(2):299. Jakkulla PK, Ganesh KM, Jayapal PK, Malla SJ, Chandanapalli SB, Sandhya E. Selection of Features Using Adaptive Tunicate Swarm Algorithm with Optimized Deep Learning Model for Thyroid Disease Classification. Ingenierie Systemes Inf. 2023;28(2):299.
28.
go back to reference Alyas T, Hamid M, Alissa K, Faiz T, Tabassum N, Ahmad A. Empirical method for thyroid disease classification using a machine learning approach. BioMed Res Int. 2022;2022:932–80. Alyas T, Hamid M, Alissa K, Faiz T, Tabassum N, Ahmad A. Empirical method for thyroid disease classification using a machine learning approach. BioMed Res Int. 2022;2022:932–80.
31.
go back to reference Junaid M, Ali S, Eid F, El-Sappagh S, Abuhmed T. Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson’s disease. Comput Methods Prog Biomed. 2023;234:107495.CrossRef Junaid M, Ali S, Eid F, El-Sappagh S, Abuhmed T. Explainable machine learning models based on multimodal time-series data for the early detection of Parkinson’s disease. Comput Methods Prog Biomed. 2023;234:107495.CrossRef
32.
go back to reference Bini F, Pica A, Azzimonti L, Giusti A, Ruinelli L, Marinozzi F, et al. Artificial intelligence in thyroid field–a comprehensive review. Cancers. 2021;13(19):4740.CrossRefPubMedPubMedCentral Bini F, Pica A, Azzimonti L, Giusti A, Ruinelli L, Marinozzi F, et al. Artificial intelligence in thyroid field–a comprehensive review. Cancers. 2021;13(19):4740.CrossRefPubMedPubMedCentral
33.
go back to reference Raza A, Munir K, Almutairi M. A novel deep learning approach for deepfake image detection. Appl Sci. 2022;12(19):9820.CrossRef Raza A, Munir K, Almutairi M. A novel deep learning approach for deepfake image detection. Appl Sci. 2022;12(19):9820.CrossRef
36.
go back to reference Ishtiaq A, Munir K, Raza A, Samee NA, Jamjoom MM, Ullah Z. Product Helpfulness Detection With Novel Transformer Based BERT Embedding and Class Probability Features. IEEE Access. 2024;12:55905–17. Ishtiaq A, Munir K, Raza A, Samee NA, Jamjoom MM, Ullah Z. Product Helpfulness Detection With Novel Transformer Based BERT Embedding and Class Probability Features. IEEE Access. 2024;12:55905–17.
37.
go back to reference Khalid M, Raza A, Younas F, Rustam F, Villar MG, Ashraf I, et al. Novel Sentiment Majority Voting Classifier and Transfer Learning-based Feature Engineering for Sentiment Analysis of Deepfake Tweets. IEEE Access. 2024;12:67117–29. Khalid M, Raza A, Younas F, Rustam F, Villar MG, Ashraf I, et al. Novel Sentiment Majority Voting Classifier and Transfer Learning-based Feature Engineering for Sentiment Analysis of Deepfake Tweets. IEEE Access. 2024;12:67117–29.
38.
go back to reference Younas F, Raza A, Thalji N, Abualigah L, Zitar RA, Jia H. An efficient artificial intelligence approach for early detection of cross-site scripting attacks. Decis Anal J. 2024;11:100466.CrossRef Younas F, Raza A, Thalji N, Abualigah L, Zitar RA, Jia H. An efficient artificial intelligence approach for early detection of cross-site scripting attacks. Decis Anal J. 2024;11:100466.CrossRef
39.
go back to reference Darawsheh SR, Al-Shaar AS, Haziemeh FA, Alshurideh MT. Classification Thyroid Disease Using Multinomial Logistic Regressions (LR). In: The Effect of Information Technology on Business and Marketing Intelligence Systems. Springer; 2023. pp. 645–659. Darawsheh SR, Al-Shaar AS, Haziemeh FA, Alshurideh MT. Classification Thyroid Disease Using Multinomial Logistic Regressions (LR). In: The Effect of Information Technology on Business and Marketing Intelligence Systems. Springer; 2023. pp. 645–659.
40.
go back to reference Raza A, Siddiqui HUR, Munir K, Almutairi M, Rustam F, Ashraf I. Ensemble learning-based feature engineering to analyze maternal health during pregnancy and health risk prediction. PLoS ONE. 2022;17(11):e0276525.CrossRefPubMedPubMedCentral Raza A, Siddiqui HUR, Munir K, Almutairi M, Rustam F, Ashraf I. Ensemble learning-based feature engineering to analyze maternal health during pregnancy and health risk prediction. PLoS ONE. 2022;17(11):e0276525.CrossRefPubMedPubMedCentral
41.
go back to reference Chen Z, Ying TC, Chen J, Wang Y, Wu C, Su Z. Assessment of Renal Fibrosis in Patients With Chronic Kidney Disease Using Shear Wave Elastography and Clinical Features: A Random Forest Approach. Ultrasound Med Biol. 2023;49(7):1665–71.CrossRefPubMed Chen Z, Ying TC, Chen J, Wang Y, Wu C, Su Z. Assessment of Renal Fibrosis in Patients With Chronic Kidney Disease Using Shear Wave Elastography and Clinical Features: A Random Forest Approach. Ultrasound Med Biol. 2023;49(7):1665–71.CrossRefPubMed
42.
go back to reference Mohi Uddin KM, Biswas N, Rikta ST, Dey SK, Qazi A. XML-LightGBMDroid: A self-driven interactive mobile application utilizing explainable machine learning for breast cancer diagnosis. Eng Rep. 2023;11:e12666. Mohi Uddin KM, Biswas N, Rikta ST, Dey SK, Qazi A. XML-LightGBMDroid: A self-driven interactive mobile application utilizing explainable machine learning for breast cancer diagnosis. Eng Rep. 2023;11:e12666.
43.
go back to reference Merkelbach K, Schaper S, Diedrich C, Fritsch SJ, Schuppert A. Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups. Sci Rep. 2023;13(1):4053.CrossRefPubMedPubMedCentral Merkelbach K, Schaper S, Diedrich C, Fritsch SJ, Schuppert A. Novel architecture for gated recurrent unit autoencoder trained on time series from electronic health records enables detection of ICU patient subgroups. Sci Rep. 2023;13(1):4053.CrossRefPubMedPubMedCentral
44.
go back to reference Wu X, Wang HY, Shi P, Sun R, Wang X, Luo Z, et al. Long short-term memory model-a deep learning approach for medical data with irregularity in cancer predication with tumor markers. Comput Biol Med. 2022;144:105362.CrossRefPubMed Wu X, Wang HY, Shi P, Sun R, Wang X, Luo Z, et al. Long short-term memory model-a deep learning approach for medical data with irregularity in cancer predication with tumor markers. Comput Biol Med. 2022;144:105362.CrossRefPubMed
46.
go back to reference Van der Velden BH, Kuijf HJ, Gilhuijs KG, Viergever MA. Explainable artificial intelligence (XAI) in deep learning-based medical image analysis. Med Image Anal. 2022;79:102470.CrossRefPubMed Van der Velden BH, Kuijf HJ, Gilhuijs KG, Viergever MA. Explainable artificial intelligence (XAI) in deep learning-based medical image analysis. Med Image Anal. 2022;79:102470.CrossRefPubMed
47.
go back to reference Ali S, Abuhmed T, El-Sappagh S, Muhammad K, Alonso-Moral JM, Confalonieri R, et al. Explainable Artificial Intelligence (XAI): what we know and what is left to attain Trustworthy Artificial Intelligence. Inf Fusion. 2023;99:101805.CrossRef Ali S, Abuhmed T, El-Sappagh S, Muhammad K, Alonso-Moral JM, Confalonieri R, et al. Explainable Artificial Intelligence (XAI): what we know and what is left to attain Trustworthy Artificial Intelligence. Inf Fusion. 2023;99:101805.CrossRef
48.
go back to reference Javed H, El-Sappagh S, Abuhmed T. Robustness in deep learning models for medical diagnostics: security and adversarial challenges towards robust AI applications. Artif Intell Rev. 2024;58(1):12.CrossRef Javed H, El-Sappagh S, Abuhmed T. Robustness in deep learning models for medical diagnostics: security and adversarial challenges towards robust AI applications. Artif Intell Rev. 2024;58(1):12.CrossRef
Metadata
Title
Enhanced interpretable thyroid disease diagnosis by leveraging synthetic oversampling and machine learning models
Authors
Ali Raza
Fatma Eid
Elisabeth Caro Montero
Irene Delgado Noya
Imran Ashraf
Publication date
01-12-2024
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2024
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-024-02780-0