Top

Published in:

01-08-2012 | Editorial

Calibration strategies to validate predictive models: is new always better?

Author: Nicolás Serrano

Published in: Intensive Care Medicine | Issue 8/2012

Excerpt

Calibration along with discrimination is an important measure of accuracy to validate predictive logistic regression models. Most predictive models in intensive care such as Simplified Acute Physiology Score (SAPS) II [1] and SAPS 3 [2, 3] consider the binary outcome whether a patient will be alive or dead at hospital discharge. Discrimination measures how well the model can distinguish between patients who die and those who survive. Discrimination is usually assessed by the area under the receiver operating characteristic curve (AU-ROC) [4]. This statistic evaluates each pair of observations that have different outcomes and calculates the proportion of times when the patient who died had a higher predicted mortality than did the survivor. The AU-ROC ranges from 0.50 (no discrimination: complete binary random of 50 % similar to flipping a coin) to 1.00 (100 % correct discrimination of the model) [4]. …

Le Gall JR, Lemeshow S, Saulnier F (1993) A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. JAMA 270:2957–2963PubMedCrossRef

Metnitz PG, Moreno RP, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 1: objectives, methods and cohort description. Intensive Care Med 31:1336–1344PubMedCrossRef

Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 2: development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med 31:1345–1355PubMedCrossRef

Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36PubMed

Hosmer DW, Lemeshow S (2000) Applied logistic regression, 2nd edn. Wiley, New YorkCrossRef

Kramer AA, Zimmerman JE (2007) Assessing the calibration of mortality benchmarks in critical care: the Hosmer–Lemeshow test revisited. Crit Care Med 35:2052–2056PubMedCrossRef

Hosmer DW, Hosmer T, Le CS, Lemeshow S (1997) A comparison of goodness-of-fit tests for the logistic regression model. Stat Med 16:965–980PubMedCrossRef

Lemeshow S, Teres D, Klar J, Avrunin JS, Gehlbach SH, Rapoport J (1993) Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA 270:2478–2486PubMedCrossRef

Finazzi S, Poole D, Luciani D, Cogo PE, Bertolini G (2011) Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLoS One 6:e16110PubMedCrossRef

10.

Poole D, Rossi C, Latronico N, Rossi G, Finazzi S, Bertolini G (2012) Comparison between SAPS II and SAPS 3 in predicting hospital mortality in a cohort of 103 Italian ICUs. Is new always better? Intensive Care Med. doi:10.1007/s00134-012-2578-0 PubMed

11.

Nassar AP Jr, Mocelin AO, Nunes AL, Giannini FP, Brauer L, Andrade FM, Dias CA (2011) Caution when using prognostic models: A prospective comparison of 3 recent prognostic models. J Crit Care. doi:10.1016/j.jcrc.2011.08.016

12.

Khwannimit B, Bhurayanontachai R (2010) The performance and customization of SAPS 3 admission score in a Thai medical intensive care unit. Intensive Care Med 36:342–346PubMedCrossRef

13.

Poole D, Rossi C, Anghileri A, Giardino M, Latronico N, Radrizzani D, Langer M, Bertolini G (2009) External validation of the Simplified Acute Physiology Score (SAPS) 3 in a cohort of 28,357 patients from 147 Italian intensive care units. Intensive Care Med 35:1916–1924PubMedCrossRef

14.

Khwannimit B, Bhurayanontachai R (2011) A comparison of the performance of Simplified Acute Physiology Score 3 with old standard severity scores and customized scores in a mixed medical-coronary care unit. Minerva Anestesiol 77:305–312PubMed

15.

Capuzzo M, Scaramuzza A, Vaccarini B, Gilli G, Zannoli S, Farabegoli L, Felisatti G, Davanzo E, Alvisi R (2009) Validation of SAPS 3 admission score and comparison with SAPS II. Acta Anaesthesiol Scand 53:589–594PubMedCrossRef

16.

Sakr Y, Krauss C, Amaral AC, Rea-Neto A, Specht M, Reinhart K, Marx G (2008) Comparison of the performance of SAPS II, SAPS 3, APACHE II, and their customized prognostic models in a surgical intensive care unit. Br J Anaesth 101:798–803PubMedCrossRef

17.

Ledoux D, Canivet JL, Preiser JC, Lefrancq J, Damas P (2008) SAPS 3 admission score: an external validation in a general intensive care population. Intensive Care Med 34:1873–1877PubMedCrossRef

18.

Silva Junior JM, Malbouisson LM, Nuevo HL, Barbosa LG, Marubayashi LY, Teixeira IC, Nassar Junior AP, Carmona MJ, Silva IF, Auler Junior JO, Rezende E (2010) Applicability of the Simplified Acute Physiology Score (SAPS 3) in Brazilian hospitals. Rev Bras Anestesiol 60:20–31PubMedCrossRef

19.

Mbongo CL, Monedero P, Guillen-Grima F, Yepes MJ, Vives M, Echarri G (2009) Performance of SAPS 3, compared with APACHE II and SOFA, to predict hospital mortality in a general ICU in Southern Europe. Eur J Anaesthesiol 26:940–945PubMedCrossRef

20.

Costa e Silva VT, de Castro I, Liano F, Muriel A, Rodriguez-Palomares JR, Yu L (2011) Performance of the third-generation models of severity scoring systems (APACHE IV, SAPS 3 and MPM-III) in acute kidney injury critically ill patients. Nephrol Dial Transplant 26:3894–3901

21.

Lim SY, Ham CR, Park SY, Kim S, Park MR, Jeon K, Um SW, Chung MP, Kim H, Kwon OJ, Suh GY (2011) Validation of the Simplified Acute Physiology Score 3 scoring system in a Korean intensive care unit. Yonsei Med J 52:59–64PubMedCrossRef

22.

Soares M, Silva UV, Teles JM, Silva E, Caruso P, Lobo SM, Dal PF, Azevedo LP, de Carvalho FB, Salluh JI (2010) Validation of four prognostic scores in patients with cancer admitted to Brazilian intensive care units: results from a prospective multicenter study. Intensive Care Med 36:1188–1195PubMedCrossRef

23.

Maccariello E, Valente C, Nogueira L, Bonomo H, Ismael M, Machado JE, Baldotto F, Godinho M, Valenca R, Rocha E, Soares M (2010) SAPS 3 scores at the start of renal replacement therapy predict mortality in critically ill patients with acute kidney injury. Kidney Int 77:51–56PubMedCrossRef

24.

Metnitz B, Schaden E, Moreno R, Le Gall JR, Bauer P, Metnitz PG (2009) Austrian validation and customization of the SAPS 3 admission score. Intensive Care Med 35:616–622PubMedCrossRef

25.

Tsai CW, Lin YF, Wu VC, Chu TS, Chen YM, Hu FC, Wu KD, Ko WJ (2008) SAPS 3 at dialysis commencement is predictive of hospital mortality in patients supported by extracorporeal membrane oxygenation and acute dialysis. Eur J Cardiothorac Surg 34:1158–1164PubMedCrossRef

26.

Soares M, Salluh JI (2006) Validation of the SAPS 3 admission prognostic model in patients with cancer in need of intensive care. Intensive Care Med 32:1839–1844PubMedCrossRef

27.

Strand K, Soreide E, Aardal S, Flaatten H (2009) A comparison of SAPS II and SAPS 3 in a Norwegian intensive care unit population. Acta Anaesthesiol Scand 53:595–600PubMedCrossRef

28.

Zajac K, Andres J, Zajac M (2009) A comparison of SAPS 2 and SAPS 3. Acta Anaesthesiol Scand 53:1230–1231PubMedCrossRef

29.

Strand K, Strand LI, Flaatten H (2010) The interrater reliability of SAPS II and SAPS 3. Intensive Care Med 36:850–853PubMedCrossRef

Title: Calibration strategies to validate predictive models: is new always better?
Author: Nicolás Serrano
Publication date: 01-08-2012
Publisher: Springer-Verlag
Published in: Intensive Care Medicine / Issue 8/2012
Print ISSN: 0342-4642
Electronic ISSN: 1432-1238
DOI: https://doi.org/10.1007/s00134-012-2579-z

At a glance: The STEP trials

Springer Medicine

Calibration strategies to validate predictive models: is new always better?

Excerpt

At a glance: The STEP trials

Springer Medicine

Excerpt

Please log in to get access to this content

Other articles of this Issue 8/2012

Impact of volume guarantee on synchronized ventilation in preterm infants: a randomized controlled trial

Candida colonization in ventilated ICU patients: no longer a bystander!

Persistent hypocoagulability in patients with septic shock predicts greater hospital mortality: impact of impaired thrombin generation

Standardised drug labelling in intensive care: results of an international survey among ESICM members

Point of care ultrasound for sepsis management in resource-limited settings: time for a new paradigm for global health care

Point of care ultrasound for sepsis management in resource-limited settings: response to Via et al.