Skip to main content
Top
Published in: BMC Public Health 1/2023

Open Access 01-12-2023 | COVID-19 | Research

Online COVID-19 diagnosis prediction using complete blood count: an innovative tool for public health

Authors: Xiaojing Teng, Zhiyi Wang

Published in: BMC Public Health | Issue 1/2023

Login to get access

Abstract

Background

COVID-19, caused by SARS-CoV-2, presents distinct diagnostic challenges due to its wide range of clinical manifestations and the overlapping symptoms with other common respiratory diseases. This study focuses on addressing these difficulties by employing machine learning (ML) methodologies, particularly the XGBoost algorithm, to utilize Complete Blood Count (CBC) parameters for predictive analysis.

Methods

We performed a retrospective study involving 2114 COVID-19 patients treated between December 2022 and January 2023 at our healthcare facility. These patients were classified into fever (1057 patients) and pneumonia groups (1057 patients), based on their clinical symptoms. The CBC data were utilized to create predictive models, with model performance evaluated through metrics like Area Under the Receiver Operating Characteristics Curve (AUC), accuracy, sensitivity, specificity, and precision. We selected the top 10 predictive variables based on their significance in disease prediction. The data were then split into a training set (70% of patients) and a validation set (30% of patients) for model validation.

Results

We identified 31 indicators with significant disparities. The XGBoost model outperformed others, with an AUC of 0.920 and high precision, sensitivity, specificity, and accuracy. The top 10 features (Age, Monocyte%, Mean Platelet Volume, Lymphocyte%, SIRI, Eosinophil count, Platelet count, Hemoglobin, Platelet Distribution Width, and Neutrophil count.) were crucial in constructing a more precise predictive model. The model demonstrated strong performance on both training (AUC = 0.977) and validation (AUC = 0.912) datasets, validated by decision curve analysis and calibration curve.

Conclusion

ML models that incorporate CBC parameters offer an innovative and effective tool for data analysis in COVID-19. They potentially enhance diagnostic accuracy and the efficacy of therapeutic interventions, ultimately contributing to a reduction in the mortality rate of this infectious disease.
Appendix
Available only for authorised users
Literature
2.
go back to reference Strategy, Policy Working Group for Ncip Epidemic Response CCfDC, Prevention. Interim guidelines for prevention and control of COVID-19 for oversea returnees. Zhonghua Liu Xing Bing Xue Za Zhi. 2020;41(8):1197–8. Strategy, Policy Working Group for Ncip Epidemic Response CCfDC, Prevention. Interim guidelines for prevention and control of COVID-19 for oversea returnees. Zhonghua Liu Xing Bing Xue Za Zhi. 2020;41(8):1197–8.
3.
go back to reference Zahorec R. Neutrophil-to-lymphocyte ratio, past, present and future perspectives. Bratisl Lek Listy. 2021;122(7):474–88.PubMed Zahorec R. Neutrophil-to-lymphocyte ratio, past, present and future perspectives. Bratisl Lek Listy. 2021;122(7):474–88.PubMed
4.
go back to reference Buonacera A, Stancanelli B, Colaci M, Malatino L. Neutrophil to lymphocyte ratio: an emerging marker of the relationships between the immune system and diseases. Int J Mol Sci. 2022;23(7):3636.CrossRefPubMedPubMedCentral Buonacera A, Stancanelli B, Colaci M, Malatino L. Neutrophil to lymphocyte ratio: an emerging marker of the relationships between the immune system and diseases. Int J Mol Sci. 2022;23(7):3636.CrossRefPubMedPubMedCentral
5.
go back to reference Dey D, Slomka PJ, Leeson P, Comaniciu D, Shrestha S, Sengupta PP, Marwick TH. Artificial intelligence in cardiovascular imaging: JACC State-of-the-art review. J Am Coll Cardiol. 2019;73(11):1317–35.CrossRefPubMedPubMedCentral Dey D, Slomka PJ, Leeson P, Comaniciu D, Shrestha S, Sengupta PP, Marwick TH. Artificial intelligence in cardiovascular imaging: JACC State-of-the-art review. J Am Coll Cardiol. 2019;73(11):1317–35.CrossRefPubMedPubMedCentral
6.
go back to reference Zhang M, Liu Y, Yao J, Wang K, Tu J, Hu Z, Jin Y, Du Y, Sun X, Chen L, Wang Z. Value of machine learning-based transrectal multimodal ultrasound combined with PSA-related indicators in the diagnosis of clinically significant Prostate cancer. Front Endocrinol (Lausanne). 2023;14:1137322.CrossRefPubMed Zhang M, Liu Y, Yao J, Wang K, Tu J, Hu Z, Jin Y, Du Y, Sun X, Chen L, Wang Z. Value of machine learning-based transrectal multimodal ultrasound combined with PSA-related indicators in the diagnosis of clinically significant Prostate cancer. Front Endocrinol (Lausanne). 2023;14:1137322.CrossRefPubMed
7.
go back to reference Ahmadzadeh M, Christie GJ, Cosco TD, Moreno S. Neuroimaging and analytical methods for studying the pathways from mild cognitive impairment to Alzheimer’s disease: protocol for a rapid systematic review. Syst Rev. 2020;9(1):71.CrossRefPubMedPubMedCentral Ahmadzadeh M, Christie GJ, Cosco TD, Moreno S. Neuroimaging and analytical methods for studying the pathways from mild cognitive impairment to Alzheimer’s disease: protocol for a rapid systematic review. Syst Rev. 2020;9(1):71.CrossRefPubMedPubMedCentral
8.
go back to reference Mahdavi M, Choubdar H, Zabeh E, Rieder M, Safavi-Naeini S, Jobbagy Z, Ghorbani A, Abedini A, Kiani A, Khanlarzadeh V, et al. A machine learning based exploration of COVID-19 mortality risk. PLoS One. 2021;16(7):e0252384.CrossRefPubMedPubMedCentral Mahdavi M, Choubdar H, Zabeh E, Rieder M, Safavi-Naeini S, Jobbagy Z, Ghorbani A, Abedini A, Kiani A, Khanlarzadeh V, et al. A machine learning based exploration of COVID-19 mortality risk. PLoS One. 2021;16(7):e0252384.CrossRefPubMedPubMedCentral
9.
go back to reference Zhou H, Wang X, Zhu R. Feature selection based on mutual information with correlation coefficient. Appl Intell. 2022;52(5):5457–74.CrossRef Zhou H, Wang X, Zhu R. Feature selection based on mutual information with correlation coefficient. Appl Intell. 2022;52(5):5457–74.CrossRef
10.
go back to reference Brodin P. Immune determinants of COVID-19 disease presentation and severity. Nat Med. 2021;27(1):28–33.CrossRefPubMed Brodin P. Immune determinants of COVID-19 disease presentation and severity. Nat Med. 2021;27(1):28–33.CrossRefPubMed
11.
go back to reference Bertsimas D, Mingardi L, Stellato B. Machine learning for real-time heart disease prediction. IEEE J Biomed Health Inform. 2021;25(9):3627–37.CrossRefPubMed Bertsimas D, Mingardi L, Stellato B. Machine learning for real-time heart disease prediction. IEEE J Biomed Health Inform. 2021;25(9):3627–37.CrossRefPubMed
12.
go back to reference Li Q, Yang H, Wang P, Liu X, Lv K, Ye M. XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer. J Transl Med. 2022;20(1):177.CrossRefPubMedPubMedCentral Li Q, Yang H, Wang P, Liu X, Lv K, Ye M. XGBoost-based and tumor-immune characterized gene signature for the prediction of metastatic status in breast cancer. J Transl Med. 2022;20(1):177.CrossRefPubMedPubMedCentral
13.
go back to reference Chen D, Liu J, Zang L, Xiao T, Zhang X, Li Z, Zhu H, Gao W, Yu X. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int J Biol Sci. 2022;18(1):360–73.CrossRefPubMedPubMedCentral Chen D, Liu J, Zang L, Xiao T, Zhang X, Li Z, Zhu H, Gao W, Yu X. Integrated machine learning and bioinformatic analyses constructed a novel stemness-related classifier to predict prognosis and immunotherapy responses for hepatocellular carcinoma patients. Int J Biol Sci. 2022;18(1):360–73.CrossRefPubMedPubMedCentral
14.
go back to reference Dudley J, Christian M, Andrews A, Andrews N, Baker J, Boyle S, Convery M, Gamston F, Garcia M, Haq S, et al. Clinical practice guidelines standardisation of immunosuppressive and anti-infective drug regimens in UK paediatric renal transplantation: the harmonisation programme. BMC Nephrol. 2021;22(1):312.CrossRefPubMedPubMedCentral Dudley J, Christian M, Andrews A, Andrews N, Baker J, Boyle S, Convery M, Gamston F, Garcia M, Haq S, et al. Clinical practice guidelines standardisation of immunosuppressive and anti-infective drug regimens in UK paediatric renal transplantation: the harmonisation programme. BMC Nephrol. 2021;22(1):312.CrossRefPubMedPubMedCentral
15.
go back to reference Zhang Y, Zhang Z, Wei L, Wei S. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Health. 2022;10:1008137.CrossRefPubMedPubMedCentral Zhang Y, Zhang Z, Wei L, Wei S. Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer. Front Public Health. 2022;10:1008137.CrossRefPubMedPubMedCentral
16.
go back to reference Lin X, Chen L, Zhang D, Luo S, Sheng Y, Liu X, Liu Q, Li J, Shi B, Peng G, et al. Prediction of surgical approach in mitral valve disease by XGBoost algorithm based on echocardiographic features. J Clin Med. 2023;12(3):1193.CrossRefPubMedPubMedCentral Lin X, Chen L, Zhang D, Luo S, Sheng Y, Liu X, Liu Q, Li J, Shi B, Peng G, et al. Prediction of surgical approach in mitral valve disease by XGBoost algorithm based on echocardiographic features. J Clin Med. 2023;12(3):1193.CrossRefPubMedPubMedCentral
17.
go back to reference Mulenga C, Kaonga P, Hamoonga R, Mazaba ML, Chabala F, Musonda P. Predicting mortality in hospitalized COVID-19 patients in Zambia: an application of machine learning. Glob Health Epidemiol Genom. 2023;2023:8921220.PubMedPubMedCentral Mulenga C, Kaonga P, Hamoonga R, Mazaba ML, Chabala F, Musonda P. Predicting mortality in hospitalized COVID-19 patients in Zambia: an application of machine learning. Glob Health Epidemiol Genom. 2023;2023:8921220.PubMedPubMedCentral
18.
go back to reference Battineni G, Sagaro GG, Chinatalapudi N, Amenta F. Applications of machine learning predictive models in the chronic disease diagnosis. J Pers Med. 2020;10(2):21.CrossRefPubMedPubMedCentral Battineni G, Sagaro GG, Chinatalapudi N, Amenta F. Applications of machine learning predictive models in the chronic disease diagnosis. J Pers Med. 2020;10(2):21.CrossRefPubMedPubMedCentral
20.
go back to reference Liu JQ, Xu JW, Sun CY, Wang JN, Wang XT, Chen X, Gao SL. Age-stratified analysis of SARS-CoV-2 Infection and case fatality rate in China, Italy, and South Korea. Eur Rev Med Pharmacol Sci. 2020;24(23):12575–8.PubMed Liu JQ, Xu JW, Sun CY, Wang JN, Wang XT, Chen X, Gao SL. Age-stratified analysis of SARS-CoV-2 Infection and case fatality rate in China, Italy, and South Korea. Eur Rev Med Pharmacol Sci. 2020;24(23):12575–8.PubMed
21.
go back to reference Bost P, Giladi A, Liu Y, Bendjelal Y, Xu G, David E, Blecher-Gonen R, Cohen M, Medaglia C, Li H, et al. Host-viral Infection maps reveal signatures of severe COVID-19 patients. Cell. 2020;181(7):1475-1488 e1412.CrossRefPubMedPubMedCentral Bost P, Giladi A, Liu Y, Bendjelal Y, Xu G, David E, Blecher-Gonen R, Cohen M, Medaglia C, Li H, et al. Host-viral Infection maps reveal signatures of severe COVID-19 patients. Cell. 2020;181(7):1475-1488 e1412.CrossRefPubMedPubMedCentral
22.
go back to reference von Meijenfeldt FA, Havervall S, Adelmeijer J, Lundstrom A, Rudberg AS, Magnusson M, Mackman N, Thalin C, Lisman T. Prothrombotic changes in patients with COVID-19 are associated with disease severity and mortality. Res Pract Thromb Haemost. 2021;5(1):132–41.CrossRef von Meijenfeldt FA, Havervall S, Adelmeijer J, Lundstrom A, Rudberg AS, Magnusson M, Mackman N, Thalin C, Lisman T. Prothrombotic changes in patients with COVID-19 are associated with disease severity and mortality. Res Pract Thromb Haemost. 2021;5(1):132–41.CrossRef
23.
go back to reference Li H, Liu SM, Yu XH, Tang SL, Tang CK. Coronavirus Disease 2019 (COVID-19): current status and future perspectives. Int J Antimicrob Agents. 2020;55(5):105951.CrossRefPubMedPubMedCentral Li H, Liu SM, Yu XH, Tang SL, Tang CK. Coronavirus Disease 2019 (COVID-19): current status and future perspectives. Int J Antimicrob Agents. 2020;55(5):105951.CrossRefPubMedPubMedCentral
24.
go back to reference Liu H, Chen J, Yang Q, Lei F, Zhang C, Qin JJ, Chen Z, Zhu L, Song X, Bai L, et al. Development and validation of a risk score using complete blood count to predict in-hospital mortality in COVID-19 patients. Med. 2021;2(4):435-447 e434.CrossRefPubMed Liu H, Chen J, Yang Q, Lei F, Zhang C, Qin JJ, Chen Z, Zhu L, Song X, Bai L, et al. Development and validation of a risk score using complete blood count to predict in-hospital mortality in COVID-19 patients. Med. 2021;2(4):435-447 e434.CrossRefPubMed
25.
go back to reference Yang L, Xie X, Tu Z, Fu J, Xu D, Zhou Y. Correction: the signal pathways and treatment of cytokine storm in COVID-19. Signal Transduct Target Ther. 2021;6(1):326.CrossRefPubMedPubMedCentral Yang L, Xie X, Tu Z, Fu J, Xu D, Zhou Y. Correction: the signal pathways and treatment of cytokine storm in COVID-19. Signal Transduct Target Ther. 2021;6(1):326.CrossRefPubMedPubMedCentral
26.
go back to reference Cavezzi A, Menicagli R, Troiani E, Corrao S. COVID-19, cation dysmetabolism, sialic Acid, CD147, ACE2, viroporins, hepcidin and ferroptosis: a possible unifying hypothesis. F1000Res. 2022;11:102.CrossRefPubMedPubMedCentral Cavezzi A, Menicagli R, Troiani E, Corrao S. COVID-19, cation dysmetabolism, sialic Acid, CD147, ACE2, viroporins, hepcidin and ferroptosis: a possible unifying hypothesis. F1000Res. 2022;11:102.CrossRefPubMedPubMedCentral
27.
go back to reference Bergamaschi G, Borrelli de Andreis F, Aronico N, Lenti MV, Barteselli C, Merli S, Pellegrino I, Coppola L, Cremonte EM, Croce G, et al. Anemia in patients with Covid-19: pathogenesis and clinical significance. Clin Exp Med. 2021;21(2):239–46.CrossRefPubMedPubMedCentral Bergamaschi G, Borrelli de Andreis F, Aronico N, Lenti MV, Barteselli C, Merli S, Pellegrino I, Coppola L, Cremonte EM, Croce G, et al. Anemia in patients with Covid-19: pathogenesis and clinical significance. Clin Exp Med. 2021;21(2):239–46.CrossRefPubMedPubMedCentral
28.
go back to reference Stephenson E, Reynolds G, Botting RA, Calero-Nieto FJ, Morgan MD, Tuong ZK, Bach K, Sungnak W, Worlock KB, Yoshida M, et al. Single-cell multi-omics analysis of the immune response in COVID-19. Nat Med. 2021;27(5):904–16.CrossRefPubMedPubMedCentral Stephenson E, Reynolds G, Botting RA, Calero-Nieto FJ, Morgan MD, Tuong ZK, Bach K, Sungnak W, Worlock KB, Yoshida M, et al. Single-cell multi-omics analysis of the immune response in COVID-19. Nat Med. 2021;27(5):904–16.CrossRefPubMedPubMedCentral
29.
go back to reference Ackermann M, Anders HJ, Bilyy R, Bowlin GL, Daniel C, De Lorenzo R, Egeblad M, Henneck T, Hidalgo A, Hoffmann M, et al. Patients with COVID-19: in the dark-NETs of neutrophils. Cell Death Differ. 2021;28(11):3125–39.CrossRefPubMedPubMedCentral Ackermann M, Anders HJ, Bilyy R, Bowlin GL, Daniel C, De Lorenzo R, Egeblad M, Henneck T, Hidalgo A, Hoffmann M, et al. Patients with COVID-19: in the dark-NETs of neutrophils. Cell Death Differ. 2021;28(11):3125–39.CrossRefPubMedPubMedCentral
30.
go back to reference Meng Z, Wang M, Zhao Z, Zhou Y, Wu Y, Guo S, Li M, Zhou Y, Yang S, Li W, Ying B. Development and validation of a predictive model for severe COVID-19: a case-control study in China. Front Med (Lausanne). 2021;8:663145.CrossRefPubMed Meng Z, Wang M, Zhao Z, Zhou Y, Wu Y, Guo S, Li M, Zhou Y, Yang S, Li W, Ying B. Development and validation of a predictive model for severe COVID-19: a case-control study in China. Front Med (Lausanne). 2021;8:663145.CrossRefPubMed
Metadata
Title
Online COVID-19 diagnosis prediction using complete blood count: an innovative tool for public health
Authors
Xiaojing Teng
Zhiyi Wang
Publication date
01-12-2023
Publisher
BioMed Central
Keyword
COVID-19
Published in
BMC Public Health / Issue 1/2023
Electronic ISSN: 1471-2458
DOI
https://doi.org/10.1186/s12889-023-17477-8

Other articles of this Issue 1/2023

BMC Public Health 1/2023 Go to the issue