Skip to main content
Top
Published in: Journal of Translational Medicine 1/2020

Open Access 01-12-2020 | Septicemia | Research

Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost

Authors: Nianzong Hou, Mingzhe Li, Lu He, Bing Xie, Lin Wang, Rumin Zhang, Yong Yu, Xiaodong Sun, Zhengsheng Pan, Kai Wang

Published in: Journal of Translational Medicine | Issue 1/2020

Login to get access

Abstract

Background

Sepsis is a significant cause of mortality in-hospital, especially in ICU patients. Early prediction of sepsis is essential, as prompt and appropriate treatment can improve survival outcomes. Machine learning methods are flexible prediction algorithms with potential advantages over conventional regression and scoring system. The aims of this study were to develop a machine learning approach using XGboost to predict the 30-days mortality for MIMIC-III Patients with sepsis-3 and to determine whether such model performs better than traditional prediction models.

Methods

Using the MIMIC-III v1.4, we identified patients with sepsis-3. The data was split into two groups based on death or survival within 30 days and variables, selected based on clinical significance and availability by stepwise analysis, were displayed and compared between groups. Three predictive models including conventional logistic regression model, SAPS-II score prediction model and XGBoost algorithm model were constructed by R software. Then, the performances of the three models were tested and compared by AUCs of the receiver operating characteristic curves and decision curve analysis. At last, nomogram and clinical impact curve were used to validate the model.

Results

A total of 4559 sepsis-3 patients are included in the study, in which, 889 patients were death and 3670 survival within 30 days, respectively. According to the results of AUCs (0.819 [95% CI 0.800–0.838], 0.797 [95% CI 0.781–0.813] and 0.857 [95% CI 0.839–0.876]) and decision curve analysis for the three models, the XGboost model performs best. The risk nomogram and clinical impact curve verify that the XGboost model possesses significant predictive value.

Conclusions

Using machine learning technique by XGboost, more significant prediction model can be built. This XGboost model may prove clinically useful and assist clinicians in tailoring precise management and therapy for the patients with sepsis-3.
Appendix
Available only for authorised users
Literature
1.
go back to reference Song JY, Park DW, Moon SW, et al. Diagnostic and prognostic value of interleukin-6, pentraxin 3, and procalcitonin levels among sepsis and septic shock patients: a prospective controlled study according to the sepsis-3 definitions. BMC Infect Dis. 2019a;19(1):968.PubMedPubMedCentralCrossRef Song JY, Park DW, Moon SW, et al. Diagnostic and prognostic value of interleukin-6, pentraxin 3, and procalcitonin levels among sepsis and septic shock patients: a prospective controlled study according to the sepsis-3 definitions. BMC Infect Dis. 2019a;19(1):968.PubMedPubMedCentralCrossRef
2.
go back to reference Singer M, Deutschman CS, Seymour CW, et al. The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA. 2016;315(8):801–10.PubMedPubMedCentralCrossRef Singer M, Deutschman CS, Seymour CW, et al. The third international consensus definitions for sepsis and septic shock (sepsis-3). JAMA. 2016;315(8):801–10.PubMedPubMedCentralCrossRef
3.
go back to reference Liu V, Escobar GJ, Greene JD, et al. Hospital deaths in patients with sepsis from 2 independent cohorts. JAMA. 2014;312(1):90–2.PubMedCrossRef Liu V, Escobar GJ, Greene JD, et al. Hospital deaths in patients with sepsis from 2 independent cohorts. JAMA. 2014;312(1):90–2.PubMedCrossRef
5.
go back to reference Song J, Moon SW, Park DW, et al. Biomarker combination and SOFA score for the prediction of mortality in sepsis and septic shock. Medicine. 2020;99(22):e20495.PubMedCrossRef Song J, Moon SW, Park DW, et al. Biomarker combination and SOFA score for the prediction of mortality in sepsis and septic shock. Medicine. 2020;99(22):e20495.PubMedCrossRef
6.
go back to reference Majdan M, Brazinova A, Rusnak M, et al. Outcome prediction after traumatic brain injury: comparison of the performance of routinely used severity scores and multivariable prognostic models. J Neurosci Rural Pract. 2017;8(1):20.PubMedPubMedCentralCrossRef Majdan M, Brazinova A, Rusnak M, et al. Outcome prediction after traumatic brain injury: comparison of the performance of routinely used severity scores and multivariable prognostic models. J Neurosci Rural Pract. 2017;8(1):20.PubMedPubMedCentralCrossRef
7.
go back to reference Song JY, Park DW, Moon SW, et al. Validation of APACHE II and SAPS II scales at the intensive care unit along with assessment of SOFA scale at the admission as an isolated risk of death predictor. Anaesthesiol Intensive Ther. 2019b;51(2):107–11.CrossRef Song JY, Park DW, Moon SW, et al. Validation of APACHE II and SAPS II scales at the intensive care unit along with assessment of SOFA scale at the admission as an isolated risk of death predictor. Anaesthesiol Intensive Ther. 2019b;51(2):107–11.CrossRef
8.
go back to reference Godinjak A, Iglica A, Rama A, et al. Predictive value of SAPS II and APACHE II scoring systems for patient outcome in a medical intensive care unit. Acta Med Acad. 2016;45(2):97–103.PubMedCrossRef Godinjak A, Iglica A, Rama A, et al. Predictive value of SAPS II and APACHE II scoring systems for patient outcome in a medical intensive care unit. Acta Med Acad. 2016;45(2):97–103.PubMedCrossRef
9.
go back to reference Yuan KC, Tsai LW, Lee KH, et al. The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit. Int J Med Inform. 2020;141:104176.PubMedCrossRef Yuan KC, Tsai LW, Lee KH, et al. The development an artificial intelligence algorithm for early sepsis diagnosis in the intensive care unit. Int J Med Inform. 2020;141:104176.PubMedCrossRef
10.
go back to reference Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining-KDD 2016, San Francisco, CA, USA; 2016. p. 785–94. Chen T, Guestrin C. XGBoost: a scalable tree boosting system. In: Proceedings of the 22nd ACM SIGKDD international conference on knowledge discovery and data mining-KDD 2016, San Francisco, CA, USA; 2016. p. 785–94.
12.
go back to reference Oweira H, Schmidt J, Mehrabi A, et al. Comparison of three prognostic models for predicting cancer-specific survival among patients with gastrointestinal stromal tumors. Future Oncol. 2018;14(4):379–89.PubMedCrossRef Oweira H, Schmidt J, Mehrabi A, et al. Comparison of three prognostic models for predicting cancer-specific survival among patients with gastrointestinal stromal tumors. Future Oncol. 2018;14(4):379–89.PubMedCrossRef
13.
go back to reference Goldberger A, Amaral L, Glass L, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000;101(23):e215–20.PubMedCrossRef Goldberger A, Amaral L, Glass L, et al. PhysioBank, PhysioToolkit, and PhysioNet: components of a new research resource for complex physiologic signals. Circulation. 2000;101(23):e215–20.PubMedCrossRef
14.
go back to reference Templ M, Alfons A, Filzmoser P. Exploring incomplete data using visualization techniques. Adv Data Anal Classif. 2012;6:29–47.CrossRef Templ M, Alfons A, Filzmoser P. Exploring incomplete data using visualization techniques. Adv Data Anal Classif. 2012;6:29–47.CrossRef
15.
go back to reference Van Buuren S, Groothuis-Oudshoorn CG. mice: multivariate imputation by chained equations in R. J Stat Softw. 2011;45:67.CrossRef Van Buuren S, Groothuis-Oudshoorn CG. mice: multivariate imputation by chained equations in R. J Stat Softw. 2011;45:67.CrossRef
16.
go back to reference Gall JRL, Lemeshow S, Saulnier F, et al. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA. 1993;270:2957–63.CrossRefPubMed Gall JRL, Lemeshow S, Saulnier F, et al. A new simplified acute physiology score (SAPS II) based on a European/North American multicenter study. JAMA. 1993;270:2957–63.CrossRefPubMed
17.
go back to reference Zhang Z, Ho KM, Hong Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute kidney injury in critical care. Crit Care. 2019;23(1):112.PubMedPubMedCentralCrossRef Zhang Z, Ho KM, Hong Y. Machine learning for the prediction of volume responsiveness in patients with oliguric acute kidney injury in critical care. Crit Care. 2019;23(1):112.PubMedPubMedCentralCrossRef
18.
go back to reference Livne M, Boldsen JK, Mikkelsen IK, et al. Boosted tree model reforms multimodal magnetic resonance imaging infarct prediction in acute stroke. Stroke. 2018;49:912–8.PubMedCrossRef Livne M, Boldsen JK, Mikkelsen IK, et al. Boosted tree model reforms multimodal magnetic resonance imaging infarct prediction in acute stroke. Stroke. 2018;49:912–8.PubMedCrossRef
19.
go back to reference Collins GS, Reitsma JB, Altman DG, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD). Circulation. 2015;131(2):211–9.PubMedPubMedCentralCrossRef Collins GS, Reitsma JB, Altman DG, et al. Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD). Circulation. 2015;131(2):211–9.PubMedPubMedCentralCrossRef
20.
go back to reference Fleischmann-Struzek C, Mellhammar L, Reinhart K, et al. Incidence and mortality of hospital and ICU-treated sepsis: results from an updated and expanded systematic review and meta-analysis. Intensive Care Med. 2020;46(8):1552–62.PubMedPubMedCentralCrossRef Fleischmann-Struzek C, Mellhammar L, Reinhart K, et al. Incidence and mortality of hospital and ICU-treated sepsis: results from an updated and expanded systematic review and meta-analysis. Intensive Care Med. 2020;46(8):1552–62.PubMedPubMedCentralCrossRef
21.
go back to reference Torio CM, Moore BJ. National inpatient hospital costs: the most expensive conditions by payer, 2013: statistical brief #204. Healthcare Cost and Utilization Project (HCUP) Statistical Briefs, Rockville (MD); 2006. Torio CM, Moore BJ. National inpatient hospital costs: the most expensive conditions by payer, 2013: statistical brief #204. Healthcare Cost and Utilization Project (HCUP) Statistical Briefs, Rockville (MD); 2006.
22.
go back to reference Martin GS, Mannino DM, Eaton S, et al. The epidemiology of sepsis in the united states from 1979 through 2000. N Engl J Med. 2003;348:1546–54.PubMedCrossRef Martin GS, Mannino DM, Eaton S, et al. The epidemiology of sepsis in the united states from 1979 through 2000. N Engl J Med. 2003;348:1546–54.PubMedCrossRef
24.
go back to reference Jouffroy R, Saade A, Vivien B, et al. Pre-hospital mechanical ventilation in septic shock patients. Am J Emerg Med. 2019;37(10):1860–3.PubMedCrossRef Jouffroy R, Saade A, Vivien B, et al. Pre-hospital mechanical ventilation in septic shock patients. Am J Emerg Med. 2019;37(10):1860–3.PubMedCrossRef
25.
go back to reference Wu L, Feng Q, Ai ML, et al. The dynamic change of serum S100B levels from day 1 to day 3 is more associated with sepsis associated encephalopathy. Sci Rep. 2020;10(1):7718.PubMedPubMedCentralCrossRef Wu L, Feng Q, Ai ML, et al. The dynamic change of serum S100B levels from day 1 to day 3 is more associated with sepsis associated encephalopathy. Sci Rep. 2020;10(1):7718.PubMedPubMedCentralCrossRef
26.
go back to reference Oud L. Epidemiology and outcomes of sepsis among hospitalizations with systemic lupus erythematosus admitted to the ICU: a population-based cohort study. J Intensive Care. 2020;8:3.PubMedPubMedCentralCrossRef Oud L. Epidemiology and outcomes of sepsis among hospitalizations with systemic lupus erythematosus admitted to the ICU: a population-based cohort study. J Intensive Care. 2020;8:3.PubMedPubMedCentralCrossRef
27.
go back to reference Xiao J, Ding RF, Xu XL, et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. J Transl Med. 2019;17(1):119.PubMedPubMedCentralCrossRef Xiao J, Ding RF, Xu XL, et al. Comparison and development of machine learning tools in the prediction of chronic kidney disease progression. J Transl Med. 2019;17(1):119.PubMedPubMedCentralCrossRef
28.
go back to reference Li YM, Li ZL, Chen F, et al. A LASSO-derived risk model for long-term mortality in Chinese patients with acute coronary syndrome. J Transl Med. 2020;18(1):157.PubMedPubMedCentralCrossRef Li YM, Li ZL, Chen F, et al. A LASSO-derived risk model for long-term mortality in Chinese patients with acute coronary syndrome. J Transl Med. 2020;18(1):157.PubMedPubMedCentralCrossRef
29.
go back to reference Nemati S, Holder A, Razmi F, et al. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med. 2018;46(4):547–53.PubMedPubMedCentralCrossRef Nemati S, Holder A, Razmi F, et al. An interpretable machine learning model for accurate prediction of sepsis in the ICU. Crit Care Med. 2018;46(4):547–53.PubMedPubMedCentralCrossRef
30.
go back to reference Seymour CW, Kennedy JN, Wang S, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321(20):2003–17.PubMedPubMedCentralCrossRef Seymour CW, Kennedy JN, Wang S, et al. Derivation, validation, and potential treatment implications of novel clinical phenotypes for sepsis. JAMA. 2019;321(20):2003–17.PubMedPubMedCentralCrossRef
31.
go back to reference Dhungana P, Serafim LP, Ruiz AL, et al. Machine learning in data abstraction: a computable phenotype for sepsis and septic shock diagnosis in the intensive care unit. World J Crit Care Med. 2019;8(7):120–6.PubMedPubMedCentralCrossRef Dhungana P, Serafim LP, Ruiz AL, et al. Machine learning in data abstraction: a computable phenotype for sepsis and septic shock diagnosis in the intensive care unit. World J Crit Care Med. 2019;8(7):120–6.PubMedPubMedCentralCrossRef
32.
go back to reference Liu R, Greenstein JL, Granite SJ, et al. Data-driven discovery of a novel sepsis pre-shock state predicts impending septic shock in the ICU. Sci Rep. 2019;9(1):6145.PubMedPubMedCentralCrossRef Liu R, Greenstein JL, Granite SJ, et al. Data-driven discovery of a novel sepsis pre-shock state predicts impending septic shock in the ICU. Sci Rep. 2019;9(1):6145.PubMedPubMedCentralCrossRef
34.
go back to reference Franzosi OS, Nunes DSL, Klanovicz TM, et al. Hemodynamic and skin perfusion is associated with successful enteral nutrition therapy in septic shock patients. Clin Nutr. 2020;S0261–5614(20):30151–5. Franzosi OS, Nunes DSL, Klanovicz TM, et al. Hemodynamic and skin perfusion is associated with successful enteral nutrition therapy in septic shock patients. Clin Nutr. 2020;S0261–5614(20):30151–5.
35.
go back to reference Pinheiro KHE, Azêdo FA, Areco KCN, et al. Risk factors and mortality in patients with sepsis, septic and non septic acute kidney injury in ICU. J Bras Nefrol. 2019;41(4):462–71.PubMedPubMedCentralCrossRef Pinheiro KHE, Azêdo FA, Areco KCN, et al. Risk factors and mortality in patients with sepsis, septic and non septic acute kidney injury in ICU. J Bras Nefrol. 2019;41(4):462–71.PubMedPubMedCentralCrossRef
36.
go back to reference Lin PC, Huang HC, Komorowski M, et al. A machine learning approach for predicting urine output after fluid administration. Comput Methods Programs Biomed. 2019;177:155–9.PubMedCrossRef Lin PC, Huang HC, Komorowski M, et al. A machine learning approach for predicting urine output after fluid administration. Comput Methods Programs Biomed. 2019;177:155–9.PubMedCrossRef
38.
go back to reference Shirazy M, Omar I, Abduljabbar D, et al. Prevalence and prognostic impact of hypernatremia in sepsis and septic shock patients in the intensive care unit: a single centre experience. J Crit Care Med. 2020;6(1):52–8.CrossRef Shirazy M, Omar I, Abduljabbar D, et al. Prevalence and prognostic impact of hypernatremia in sepsis and septic shock patients in the intensive care unit: a single centre experience. J Crit Care Med. 2020;6(1):52–8.CrossRef
39.
go back to reference Zhang K, Lv D, Deng Y, et al. STAPLAg: a convenient early warning score for use in infected patients in the intensive care unit. Medicine. 2020;99(22):e20274.PubMedCrossRef Zhang K, Lv D, Deng Y, et al. STAPLAg: a convenient early warning score for use in infected patients in the intensive care unit. Medicine. 2020;99(22):e20274.PubMedCrossRef
40.
go back to reference Ding XF, Yang ZY, Xu ZT, et al. Early goal-directed and lactate-guided therapy in adult patients with severe sepsis and septic shock: a meta-analysis of randomized controlled trials. J Transl Med. 2018;16(1):331.PubMedPubMedCentralCrossRef Ding XF, Yang ZY, Xu ZT, et al. Early goal-directed and lactate-guided therapy in adult patients with severe sepsis and septic shock: a meta-analysis of randomized controlled trials. J Transl Med. 2018;16(1):331.PubMedPubMedCentralCrossRef
41.
go back to reference Liu YL, Zheng J, Zhang DS, et al. Neutrophil-lymphocyte ratio and plasma lactate predict 28-day mortality in patients with sepsis. J Clin Lab Anal. 2019;33(7):e22942.PubMedPubMedCentral Liu YL, Zheng J, Zhang DS, et al. Neutrophil-lymphocyte ratio and plasma lactate predict 28-day mortality in patients with sepsis. J Clin Lab Anal. 2019;33(7):e22942.PubMedPubMedCentral
42.
go back to reference Velissaris D, Karamouzos V, Pantzaris ND, et al. Relation between central venous, peripheral venous and arterial lactate levels in patients with sepsis in the emergency department. J Clin Med Res. 2019;11(9):629–34.PubMedPubMedCentralCrossRef Velissaris D, Karamouzos V, Pantzaris ND, et al. Relation between central venous, peripheral venous and arterial lactate levels in patients with sepsis in the emergency department. J Clin Med Res. 2019;11(9):629–34.PubMedPubMedCentralCrossRef
43.
go back to reference Lyons PG, Micek ST, Hampton N, et al. Sepsis-associated coagulopathy severity predicts hospital mortality. Crit Care Med. 2018;46(5):736–42.PubMedCrossRef Lyons PG, Micek ST, Hampton N, et al. Sepsis-associated coagulopathy severity predicts hospital mortality. Crit Care Med. 2018;46(5):736–42.PubMedCrossRef
44.
go back to reference Casado-Méndez M, Fernandez-Pacheco J, Arellano-Orden V, et al. Relationship of thromboelastography and conventional clotting test values with severe bleeding in critically ill patients with coagulopathy: a prospective study. Int J Lab Hematol. 2019;41(5):671–8.PubMedCrossRef Casado-Méndez M, Fernandez-Pacheco J, Arellano-Orden V, et al. Relationship of thromboelastography and conventional clotting test values with severe bleeding in critically ill patients with coagulopathy: a prospective study. Int J Lab Hematol. 2019;41(5):671–8.PubMedCrossRef
45.
go back to reference Lemiale V, Pons S, Mirouse A, et al. Sepsis and septic shock in patients with malignancies: a Groupe de Recherche Respiratoire en Réanimation Onco-Hématologique study. Crit Care Med. 2020;48(6):822–9.PubMedCrossRef Lemiale V, Pons S, Mirouse A, et al. Sepsis and septic shock in patients with malignancies: a Groupe de Recherche Respiratoire en Réanimation Onco-Hématologique study. Crit Care Med. 2020;48(6):822–9.PubMedCrossRef
Metadata
Title
Predicting 30-days mortality for MIMIC-III patients with sepsis-3: a machine learning approach using XGboost
Authors
Nianzong Hou
Mingzhe Li
Lu He
Bing Xie
Lin Wang
Rumin Zhang
Yong Yu
Xiaodong Sun
Zhengsheng Pan
Kai Wang
Publication date
01-12-2020
Publisher
BioMed Central
Published in
Journal of Translational Medicine / Issue 1/2020
Electronic ISSN: 1479-5876
DOI
https://doi.org/10.1186/s12967-020-02620-5

Other articles of this Issue 1/2020

Journal of Translational Medicine 1/2020 Go to the issue
Live Webinar | 27-06-2024 | 18:00 (CEST)

Keynote webinar | Spotlight on medication adherence

Live: Thursday 27th June 2024, 18:00-19:30 (CEST)

WHO estimates that half of all patients worldwide are non-adherent to their prescribed medication. The consequences of poor adherence can be catastrophic, on both the individual and population level.

Join our expert panel to discover why you need to understand the drivers of non-adherence in your patients, and how you can optimize medication adherence in your clinics to drastically improve patient outcomes.

Prof. Kevin Dolgin
Prof. Florian Limbourg
Prof. Anoop Chauhan
Developed by: Springer Medicine
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discusses last year's major advances in heart failure and cardiomyopathies.