Skip to main content
Top
Published in: BMC Infectious Diseases 1/2023

Open Access 01-12-2023 | Research

Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia

Authors: Benjamin R. McFadden, Timothy J. J. Inglis, Mark Reynolds

Published in: BMC Infectious Diseases | Issue 1/2023

Login to get access

Abstract

Background

Bloodstream infections (BSIs) are a significant burden on the global population and represent a key area of focus in the hospital environment. Blood culture (BC) testing is the standard diagnostic test utilised to confirm the presence of a BSI. However, current BC testing practices result in low positive yields and overuse of the diagnostic test. Diagnostic stewardship research regarding BC testing is increasing, and becoming more important to reduce unnecessary resource expenditure and antimicrobial use, especially as antimicrobial resistance continues to rise. This study aims to establish a machine learning (ML) pipeline for BC outcome prediction using data obtained from routinely analysed blood samples, including complete blood count (CBC), white blood cell differential (DIFF), and cell population data (CPD) produced by Sysmex XN-2000 analysers.

Methods

ML models were trained using retrospective data produced between 2018 and 2019, from patients at Sir Charles Gairdner hospital, Nedlands, Western Australia, and processed at Pathwest Laboratory Medicine, Nedlands. Trained ML models were evaluated using stratified 10-fold cross validation.

Results

Two ML models, an XGBoost model using CBC/DIFF/CPD features with boruta feature selection (BFS) , and a random forest model trained using CBC/DIFF features with BFS were selected for further validation after obtaining AUC scores of \(0.76 \pm 0.04\) and \(0.75 \pm 0.04\) respectively using stratified 10-fold cross validation. The XGBoost model obtained an AUC score of 0.76 on a internal validation set. The random forest model obtained AUC scores of 0.82 and 0.76 on internal and external validation datasets respectively.

Conclusions

We have demonstrated the utility of using an ML pipeline combined with CBC/DIFF, and CBC/DIFF/CPD feature spaces for BC outcome prediction. This builds on the growing body of research in the area of BC outcome prediction, and provides opportunity for further research.
Appendix
Available only for authorised users
Literature
2.
go back to reference Linsenmeyer K, Gupta K, Strymish JM, Dhanani M, Brecher SM, Breu AC. Culture if spikes? Indications and yield of blood cultures in hospitalized medical patients. J Hosp Med. 2016;11(5):336–40.CrossRefPubMed Linsenmeyer K, Gupta K, Strymish JM, Dhanani M, Brecher SM, Breu AC. Culture if spikes? Indications and yield of blood cultures in hospitalized medical patients. J Hosp Med. 2016;11(5):336–40.CrossRefPubMed
3.
go back to reference Zwang O, Albert RK. Analysis of strategies to improve cost effectiveness of blood cultures. J Hosp Med Off Publ Soc Hosp Med. 2006;1(5):272–6. Zwang O, Albert RK. Analysis of strategies to improve cost effectiveness of blood cultures. J Hosp Med Off Publ Soc Hosp Med. 2006;1(5):272–6.
4.
go back to reference Fabre V, Carroll KC, Cosgrove SE. Blood culture utilization in the hospital setting: a call for diagnostic stewardship. J Clin Microbiol. 2022;60(3):01005–21.CrossRef Fabre V, Carroll KC, Cosgrove SE. Blood culture utilization in the hospital setting: a call for diagnostic stewardship. J Clin Microbiol. 2022;60(3):01005–21.CrossRef
5.
go back to reference Dunagan WC, Woodward RS, Medoff G, Gray JL III, Casabar E, Smith MD, et al. Antimicrobial misuse in patients with positive blood cultures. Am J Med. 1989;87(3):253–9.CrossRefPubMed Dunagan WC, Woodward RS, Medoff G, Gray JL III, Casabar E, Smith MD, et al. Antimicrobial misuse in patients with positive blood cultures. Am J Med. 1989;87(3):253–9.CrossRefPubMed
6.
go back to reference Bates DW, Goldman L, Lee TH. Contaminant blood cultures and resource utilization: the true consequences of false-positive results. JAMA. 1991;265(3):365–9.CrossRefPubMed Bates DW, Goldman L, Lee TH. Contaminant blood cultures and resource utilization: the true consequences of false-positive results. JAMA. 1991;265(3):365–9.CrossRefPubMed
7.
go back to reference Murray CJ, Ikuta KS, Sharara F, Swetschinski L, Aguilar GR, Gray A, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399(10325):629–55.CrossRef Murray CJ, Ikuta KS, Sharara F, Swetschinski L, Aguilar GR, Gray A, et al. Global burden of bacterial antimicrobial resistance in 2019: a systematic analysis. Lancet. 2022;399(10325):629–55.CrossRef
8.
go back to reference Messacar K, Parker SK, Todd JK, Dominguez SR. Implementation of rapid molecular infectious disease diagnostics: the role of diagnostic and antimicrobial stewardship. J Clin Microbiol. 2017;55(3):715–23.CrossRefPubMedPubMedCentral Messacar K, Parker SK, Todd JK, Dominguez SR. Implementation of rapid molecular infectious disease diagnostics: the role of diagnostic and antimicrobial stewardship. J Clin Microbiol. 2017;55(3):715–23.CrossRefPubMedPubMedCentral
11.
go back to reference Nannan Panday R, Wang S, Van De Ven P, Hekker T, Alam N, Nanayakkara P. Evaluation of blood culture epidemiology and efficiency in a large European teaching hospital. PLoS ONE. 2019;14(3):0214052.CrossRef Nannan Panday R, Wang S, Van De Ven P, Hekker T, Alam N, Nanayakkara P. Evaluation of blood culture epidemiology and efficiency in a large European teaching hospital. PLoS ONE. 2019;14(3):0214052.CrossRef
13.
go back to reference Di Luise D, Giannotta JA, Ammirabile M, De Zordi V, Torricelli S, Bottalico S, et al. Cell Population Data NE-WX, NE-FSC, LY-Y of Sysmex XN-9000 can provide additional information to differentiate macrocytic anaemia from myelodysplastic syndrome: A preliminary study. Int J Lab Hematol. 2022;44(1):40–3. https://doi.org/10.1111/ijlh.13697.CrossRef Di Luise D, Giannotta JA, Ammirabile M, De Zordi V, Torricelli S, Bottalico S, et al. Cell Population Data NE-WX, NE-FSC, LY-Y of Sysmex XN-9000 can provide additional information to differentiate macrocytic anaemia from myelodysplastic syndrome: A preliminary study. Int J Lab Hematol. 2022;44(1):40–3. https://​doi.​org/​10.​1111/​ijlh.​13697.CrossRef
16.
go back to reference Breiman L, Friedman J, Olsen R, Stone C. Classification and regression trees. The Wadsworth statistics/probability series. Belmont, Calif: Wadsworth International Group; c1984. Breiman L, Friedman J, Olsen R, Stone C. Classification and regression trees. The Wadsworth statistics/probability series. Belmont, Calif: Wadsworth International Group; c1984.
18.
go back to reference Kursa MB, Rudnicki WR. Feature selection with the Boruta package. J Stat Softw. 2010;36:1–13.CrossRef Kursa MB, Rudnicki WR. Feature selection with the Boruta package. J Stat Softw. 2010;36:1–13.CrossRef
19.
go back to reference Maurya NS, Kushwah S, Kushwaha S, Chawade A, Mani A. Prognostic model development for classification of colorectal adenocarcinoma by using machine learning model based on feature selection technique boruta. Sci Rep. 2023;13(1):6413.CrossRefPubMedPubMedCentral Maurya NS, Kushwah S, Kushwaha S, Chawade A, Mani A. Prognostic model development for classification of colorectal adenocarcinoma by using machine learning model based on feature selection technique boruta. Sci Rep. 2023;13(1):6413.CrossRefPubMedPubMedCentral
20.
go back to reference Ali NM, Aziz N, Besar R. Comparison of microarray breast cancer classification using support vector machine and logistic regression with LASSO and boruta feature selection. Indones J Electr Eng Comput Sci. 2020;20(2):712–9. Ali NM, Aziz N, Besar R. Comparison of microarray breast cancer classification using support vector machine and logistic regression with LASSO and boruta feature selection. Indones J Electr Eng Comput Sci. 2020;20(2):712–9.
21.
go back to reference Manhar MA, Soesanti I, Setiawan NA. A improving feature selection on heart disease dataset with Boruta approach. J FORTEI-JEERI. 2020;1(1):41–8.CrossRef Manhar MA, Soesanti I, Setiawan NA. A improving feature selection on heart disease dataset with Boruta approach. J FORTEI-JEERI. 2020;1(1):41–8.CrossRef
22.
go back to reference Leong LK, Abdullah AA. Prediction of alzheimer’s disease (AD) using machine learning techniques with Boruta algorithm as feature selection method. In: Journal of Physics: Conference Series. vol. 1372. IOP Publishing; 2019. p. 012065. Leong LK, Abdullah AA. Prediction of alzheimer’s disease (AD) using machine learning techniques with Boruta algorithm as feature selection method. In: Journal of Physics: Conference Series. vol. 1372. IOP Publishing; 2019. p. 012065.
23.
go back to reference Zhou H, Xin Y, Li S. A diabetes prediction model based on Boruta feature selection and ensemble learning. BMC Bioinformatics. 2023;24(1):1–34.CrossRef Zhou H, Xin Y, Li S. A diabetes prediction model based on Boruta feature selection and ensemble learning. BMC Bioinformatics. 2023;24(1):1–34.CrossRef
28.
go back to reference Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–30. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, et al. Scikit-learn: Machine Learning in Python. J Mach Learn Res. 2011;12:2825–30.
29.
go back to reference Lemaître G, Nogueira F, Aridas C.K. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res. 2017;18(17):1–5. [Accessed 2 Feb 2023]. Lemaître G, Nogueira F, Aridas C.K. Imbalanced-learn: A python toolbox to tackle the curse of imbalanced datasets in machine learning. J Mach Learn Res. 2017;18(17):1–5. [Accessed 2 Feb 2023].
32.
go back to reference Urrechaga E, Bóveda O, Aguirre U, García S, Pulido E. Neutrophil cell population data biomarkers for acute bacterial infection. J Pathol Infect Dis. 2018;1(1):1–7. Urrechaga E, Bóveda O, Aguirre U, García S, Pulido E. Neutrophil cell population data biomarkers for acute bacterial infection. J Pathol Infect Dis. 2018;1(1):1–7.
36.
go back to reference Lien F, Lin HS, Wu YT, Chiueh TS. Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests. BMC Infect Dis. 2022;22(1):1–10.CrossRef Lien F, Lin HS, Wu YT, Chiueh TS. Bacteremia detection from complete blood count and differential leukocyte count with machine learning: complementary and competitive with C-reactive protein and procalcitonin tests. BMC Infect Dis. 2022;22(1):1–10.CrossRef
37.
go back to reference Boerman AW, Schinkel M, Meijerink L, van den Ende ES, Pladet LC, Scholtemeijer MG, et al. Using machine learning to predict blood culture outcomes in the emergency department: a single-centre, retrospective, observational study. BMJ Open. 2022;12(1):053332.CrossRef Boerman AW, Schinkel M, Meijerink L, van den Ende ES, Pladet LC, Scholtemeijer MG, et al. Using machine learning to predict blood culture outcomes in the emergency department: a single-centre, retrospective, observational study. BMJ Open. 2022;12(1):053332.CrossRef
38.
go back to reference Schinkel M, Boerman AW, Bennis FC, Minderhoud TC, Lie M, Peters-Sengers H, et al. Diagnostic stewardship for blood cultures in the emergency department: a multicenter validation and prospective evaluation of a machine learning prediction tool. EBioMedicine. 2022;82:104176.CrossRefPubMedPubMedCentral Schinkel M, Boerman AW, Bennis FC, Minderhoud TC, Lie M, Peters-Sengers H, et al. Diagnostic stewardship for blood cultures in the emergency department: a multicenter validation and prospective evaluation of a machine learning prediction tool. EBioMedicine. 2022;82:104176.CrossRefPubMedPubMedCentral
Metadata
Title
Machine learning pipeline for blood culture outcome prediction using Sysmex XN-2000 blood sample results in Western Australia
Authors
Benjamin R. McFadden
Timothy J. J. Inglis
Mark Reynolds
Publication date
01-12-2023
Publisher
BioMed Central
Published in
BMC Infectious Diseases / Issue 1/2023
Electronic ISSN: 1471-2334
DOI
https://doi.org/10.1186/s12879-023-08535-y

Other articles of this Issue 1/2023

BMC Infectious Diseases 1/2023 Go to the issue
Live Webinar | 27-06-2024 | 18:00 (CEST)

Keynote webinar | Spotlight on medication adherence

Live: Thursday 27th June 2024, 18:00-19:30 (CEST)

WHO estimates that half of all patients worldwide are non-adherent to their prescribed medication. The consequences of poor adherence can be catastrophic, on both the individual and population level.

Join our expert panel to discover why you need to understand the drivers of non-adherence in your patients, and how you can optimize medication adherence in your clinics to drastically improve patient outcomes.

Prof. Kevin Dolgin
Prof. Florian Limbourg
Prof. Anoop Chauhan
Developed by: Springer Medicine
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine