Skip to main content
Top
Published in: Intensive Care Medicine 8/2012

01-08-2012 | Editorial

Calibration strategies to validate predictive models: is new always better?

Author: Nicolás Serrano

Published in: Intensive Care Medicine | Issue 8/2012

Login to get access

Excerpt

Calibration along with discrimination is an important measure of accuracy to validate predictive logistic regression models. Most predictive models in intensive care such as Simplified Acute Physiology Score (SAPS) II [1] and SAPS 3 [2, 3] consider the binary outcome whether a patient will be alive or dead at hospital discharge. Discrimination measures how well the model can distinguish between patients who die and those who survive. Discrimination is usually assessed by the area under the receiver operating characteristic curve (AU-ROC) [4]. This statistic evaluates each pair of observations that have different outcomes and calculates the proportion of times when the patient who died had a higher predicted mortality than did the survivor. The AU-ROC ranges from 0.50 (no discrimination: complete binary random of 50 % similar to flipping a coin) to 1.00 (100 % correct discrimination of the model) [4]. …
Literature
1.
go back to reference Le Gall JR, Lemeshow S, Saulnier F (1993) A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. JAMA 270:2957–2963PubMedCrossRef Le Gall JR, Lemeshow S, Saulnier F (1993) A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study. JAMA 270:2957–2963PubMedCrossRef
2.
go back to reference Metnitz PG, Moreno RP, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 1: objectives, methods and cohort description. Intensive Care Med 31:1336–1344PubMedCrossRef Metnitz PG, Moreno RP, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 1: objectives, methods and cohort description. Intensive Care Med 31:1336–1344PubMedCrossRef
3.
go back to reference Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 2: development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med 31:1345–1355PubMedCrossRef Moreno RP, Metnitz PG, Almeida E, Jordan B, Bauer P, Campos RA, Iapichino G, Edbrooke D, Capuzzo M, Le Gall JR (2005) SAPS 3—from evaluation of the patient to evaluation of the intensive care unit. Part 2: development of a prognostic model for hospital mortality at ICU admission. Intensive Care Med 31:1345–1355PubMedCrossRef
4.
go back to reference Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36PubMed Hanley JA, McNeil BJ (1982) The meaning and use of the area under a receiver operating characteristic (ROC) curve. Radiology 143:29–36PubMed
5.
go back to reference Hosmer DW, Lemeshow S (2000) Applied logistic regression, 2nd edn. Wiley, New YorkCrossRef Hosmer DW, Lemeshow S (2000) Applied logistic regression, 2nd edn. Wiley, New YorkCrossRef
6.
go back to reference Kramer AA, Zimmerman JE (2007) Assessing the calibration of mortality benchmarks in critical care: the Hosmer–Lemeshow test revisited. Crit Care Med 35:2052–2056PubMedCrossRef Kramer AA, Zimmerman JE (2007) Assessing the calibration of mortality benchmarks in critical care: the Hosmer–Lemeshow test revisited. Crit Care Med 35:2052–2056PubMedCrossRef
7.
go back to reference Hosmer DW, Hosmer T, Le CS, Lemeshow S (1997) A comparison of goodness-of-fit tests for the logistic regression model. Stat Med 16:965–980PubMedCrossRef Hosmer DW, Hosmer T, Le CS, Lemeshow S (1997) A comparison of goodness-of-fit tests for the logistic regression model. Stat Med 16:965–980PubMedCrossRef
8.
go back to reference Lemeshow S, Teres D, Klar J, Avrunin JS, Gehlbach SH, Rapoport J (1993) Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA 270:2478–2486PubMedCrossRef Lemeshow S, Teres D, Klar J, Avrunin JS, Gehlbach SH, Rapoport J (1993) Mortality probability models (MPM II) based on an international cohort of intensive care unit patients. JAMA 270:2478–2486PubMedCrossRef
9.
go back to reference Finazzi S, Poole D, Luciani D, Cogo PE, Bertolini G (2011) Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLoS One 6:e16110PubMedCrossRef Finazzi S, Poole D, Luciani D, Cogo PE, Bertolini G (2011) Calibration belt for quality-of-care assessment based on dichotomous outcomes. PLoS One 6:e16110PubMedCrossRef
10.
go back to reference Poole D, Rossi C, Latronico N, Rossi G, Finazzi S, Bertolini G (2012) Comparison between SAPS II and SAPS 3 in predicting hospital mortality in a cohort of 103 Italian ICUs. Is new always better? Intensive Care Med. doi:10.1007/s00134-012-2578-0 PubMed Poole D, Rossi C, Latronico N, Rossi G, Finazzi S, Bertolini G (2012) Comparison between SAPS II and SAPS 3 in predicting hospital mortality in a cohort of 103 Italian ICUs. Is new always better? Intensive Care Med. doi:10.​1007/​s00134-012-2578-0 PubMed
11.
go back to reference Nassar AP Jr, Mocelin AO, Nunes AL, Giannini FP, Brauer L, Andrade FM, Dias CA (2011) Caution when using prognostic models: A prospective comparison of 3 recent prognostic models. J Crit Care. doi:10.1016/j.jcrc.2011.08.016 Nassar AP Jr, Mocelin AO, Nunes AL, Giannini FP, Brauer L, Andrade FM, Dias CA (2011) Caution when using prognostic models: A prospective comparison of 3 recent prognostic models. J Crit Care. doi:10.​1016/​j.​jcrc.​2011.​08.​016
12.
go back to reference Khwannimit B, Bhurayanontachai R (2010) The performance and customization of SAPS 3 admission score in a Thai medical intensive care unit. Intensive Care Med 36:342–346PubMedCrossRef Khwannimit B, Bhurayanontachai R (2010) The performance and customization of SAPS 3 admission score in a Thai medical intensive care unit. Intensive Care Med 36:342–346PubMedCrossRef
13.
go back to reference Poole D, Rossi C, Anghileri A, Giardino M, Latronico N, Radrizzani D, Langer M, Bertolini G (2009) External validation of the Simplified Acute Physiology Score (SAPS) 3 in a cohort of 28,357 patients from 147 Italian intensive care units. Intensive Care Med 35:1916–1924PubMedCrossRef Poole D, Rossi C, Anghileri A, Giardino M, Latronico N, Radrizzani D, Langer M, Bertolini G (2009) External validation of the Simplified Acute Physiology Score (SAPS) 3 in a cohort of 28,357 patients from 147 Italian intensive care units. Intensive Care Med 35:1916–1924PubMedCrossRef
14.
go back to reference Khwannimit B, Bhurayanontachai R (2011) A comparison of the performance of Simplified Acute Physiology Score 3 with old standard severity scores and customized scores in a mixed medical-coronary care unit. Minerva Anestesiol 77:305–312PubMed Khwannimit B, Bhurayanontachai R (2011) A comparison of the performance of Simplified Acute Physiology Score 3 with old standard severity scores and customized scores in a mixed medical-coronary care unit. Minerva Anestesiol 77:305–312PubMed
15.
go back to reference Capuzzo M, Scaramuzza A, Vaccarini B, Gilli G, Zannoli S, Farabegoli L, Felisatti G, Davanzo E, Alvisi R (2009) Validation of SAPS 3 admission score and comparison with SAPS II. Acta Anaesthesiol Scand 53:589–594PubMedCrossRef Capuzzo M, Scaramuzza A, Vaccarini B, Gilli G, Zannoli S, Farabegoli L, Felisatti G, Davanzo E, Alvisi R (2009) Validation of SAPS 3 admission score and comparison with SAPS II. Acta Anaesthesiol Scand 53:589–594PubMedCrossRef
16.
go back to reference Sakr Y, Krauss C, Amaral AC, Rea-Neto A, Specht M, Reinhart K, Marx G (2008) Comparison of the performance of SAPS II, SAPS 3, APACHE II, and their customized prognostic models in a surgical intensive care unit. Br J Anaesth 101:798–803PubMedCrossRef Sakr Y, Krauss C, Amaral AC, Rea-Neto A, Specht M, Reinhart K, Marx G (2008) Comparison of the performance of SAPS II, SAPS 3, APACHE II, and their customized prognostic models in a surgical intensive care unit. Br J Anaesth 101:798–803PubMedCrossRef
17.
go back to reference Ledoux D, Canivet JL, Preiser JC, Lefrancq J, Damas P (2008) SAPS 3 admission score: an external validation in a general intensive care population. Intensive Care Med 34:1873–1877PubMedCrossRef Ledoux D, Canivet JL, Preiser JC, Lefrancq J, Damas P (2008) SAPS 3 admission score: an external validation in a general intensive care population. Intensive Care Med 34:1873–1877PubMedCrossRef
18.
go back to reference Silva Junior JM, Malbouisson LM, Nuevo HL, Barbosa LG, Marubayashi LY, Teixeira IC, Nassar Junior AP, Carmona MJ, Silva IF, Auler Junior JO, Rezende E (2010) Applicability of the Simplified Acute Physiology Score (SAPS 3) in Brazilian hospitals. Rev Bras Anestesiol 60:20–31PubMedCrossRef Silva Junior JM, Malbouisson LM, Nuevo HL, Barbosa LG, Marubayashi LY, Teixeira IC, Nassar Junior AP, Carmona MJ, Silva IF, Auler Junior JO, Rezende E (2010) Applicability of the Simplified Acute Physiology Score (SAPS 3) in Brazilian hospitals. Rev Bras Anestesiol 60:20–31PubMedCrossRef
19.
go back to reference Mbongo CL, Monedero P, Guillen-Grima F, Yepes MJ, Vives M, Echarri G (2009) Performance of SAPS 3, compared with APACHE II and SOFA, to predict hospital mortality in a general ICU in Southern Europe. Eur J Anaesthesiol 26:940–945PubMedCrossRef Mbongo CL, Monedero P, Guillen-Grima F, Yepes MJ, Vives M, Echarri G (2009) Performance of SAPS 3, compared with APACHE II and SOFA, to predict hospital mortality in a general ICU in Southern Europe. Eur J Anaesthesiol 26:940–945PubMedCrossRef
20.
go back to reference Costa e Silva VT, de Castro I, Liano F, Muriel A, Rodriguez-Palomares JR, Yu L (2011) Performance of the third-generation models of severity scoring systems (APACHE IV, SAPS 3 and MPM-III) in acute kidney injury critically ill patients. Nephrol Dial Transplant 26:3894–3901 Costa e Silva VT, de Castro I, Liano F, Muriel A, Rodriguez-Palomares JR, Yu L (2011) Performance of the third-generation models of severity scoring systems (APACHE IV, SAPS 3 and MPM-III) in acute kidney injury critically ill patients. Nephrol Dial Transplant 26:3894–3901
21.
go back to reference Lim SY, Ham CR, Park SY, Kim S, Park MR, Jeon K, Um SW, Chung MP, Kim H, Kwon OJ, Suh GY (2011) Validation of the Simplified Acute Physiology Score 3 scoring system in a Korean intensive care unit. Yonsei Med J 52:59–64PubMedCrossRef Lim SY, Ham CR, Park SY, Kim S, Park MR, Jeon K, Um SW, Chung MP, Kim H, Kwon OJ, Suh GY (2011) Validation of the Simplified Acute Physiology Score 3 scoring system in a Korean intensive care unit. Yonsei Med J 52:59–64PubMedCrossRef
22.
go back to reference Soares M, Silva UV, Teles JM, Silva E, Caruso P, Lobo SM, Dal PF, Azevedo LP, de Carvalho FB, Salluh JI (2010) Validation of four prognostic scores in patients with cancer admitted to Brazilian intensive care units: results from a prospective multicenter study. Intensive Care Med 36:1188–1195PubMedCrossRef Soares M, Silva UV, Teles JM, Silva E, Caruso P, Lobo SM, Dal PF, Azevedo LP, de Carvalho FB, Salluh JI (2010) Validation of four prognostic scores in patients with cancer admitted to Brazilian intensive care units: results from a prospective multicenter study. Intensive Care Med 36:1188–1195PubMedCrossRef
23.
go back to reference Maccariello E, Valente C, Nogueira L, Bonomo H, Ismael M, Machado JE, Baldotto F, Godinho M, Valenca R, Rocha E, Soares M (2010) SAPS 3 scores at the start of renal replacement therapy predict mortality in critically ill patients with acute kidney injury. Kidney Int 77:51–56PubMedCrossRef Maccariello E, Valente C, Nogueira L, Bonomo H, Ismael M, Machado JE, Baldotto F, Godinho M, Valenca R, Rocha E, Soares M (2010) SAPS 3 scores at the start of renal replacement therapy predict mortality in critically ill patients with acute kidney injury. Kidney Int 77:51–56PubMedCrossRef
24.
go back to reference Metnitz B, Schaden E, Moreno R, Le Gall JR, Bauer P, Metnitz PG (2009) Austrian validation and customization of the SAPS 3 admission score. Intensive Care Med 35:616–622PubMedCrossRef Metnitz B, Schaden E, Moreno R, Le Gall JR, Bauer P, Metnitz PG (2009) Austrian validation and customization of the SAPS 3 admission score. Intensive Care Med 35:616–622PubMedCrossRef
25.
go back to reference Tsai CW, Lin YF, Wu VC, Chu TS, Chen YM, Hu FC, Wu KD, Ko WJ (2008) SAPS 3 at dialysis commencement is predictive of hospital mortality in patients supported by extracorporeal membrane oxygenation and acute dialysis. Eur J Cardiothorac Surg 34:1158–1164PubMedCrossRef Tsai CW, Lin YF, Wu VC, Chu TS, Chen YM, Hu FC, Wu KD, Ko WJ (2008) SAPS 3 at dialysis commencement is predictive of hospital mortality in patients supported by extracorporeal membrane oxygenation and acute dialysis. Eur J Cardiothorac Surg 34:1158–1164PubMedCrossRef
26.
go back to reference Soares M, Salluh JI (2006) Validation of the SAPS 3 admission prognostic model in patients with cancer in need of intensive care. Intensive Care Med 32:1839–1844PubMedCrossRef Soares M, Salluh JI (2006) Validation of the SAPS 3 admission prognostic model in patients with cancer in need of intensive care. Intensive Care Med 32:1839–1844PubMedCrossRef
27.
go back to reference Strand K, Soreide E, Aardal S, Flaatten H (2009) A comparison of SAPS II and SAPS 3 in a Norwegian intensive care unit population. Acta Anaesthesiol Scand 53:595–600PubMedCrossRef Strand K, Soreide E, Aardal S, Flaatten H (2009) A comparison of SAPS II and SAPS 3 in a Norwegian intensive care unit population. Acta Anaesthesiol Scand 53:595–600PubMedCrossRef
28.
go back to reference Zajac K, Andres J, Zajac M (2009) A comparison of SAPS 2 and SAPS 3. Acta Anaesthesiol Scand 53:1230–1231PubMedCrossRef Zajac K, Andres J, Zajac M (2009) A comparison of SAPS 2 and SAPS 3. Acta Anaesthesiol Scand 53:1230–1231PubMedCrossRef
29.
go back to reference Strand K, Strand LI, Flaatten H (2010) The interrater reliability of SAPS II and SAPS 3. Intensive Care Med 36:850–853PubMedCrossRef Strand K, Strand LI, Flaatten H (2010) The interrater reliability of SAPS II and SAPS 3. Intensive Care Med 36:850–853PubMedCrossRef
Metadata
Title
Calibration strategies to validate predictive models: is new always better?
Author
Nicolás Serrano
Publication date
01-08-2012
Publisher
Springer-Verlag
Published in
Intensive Care Medicine / Issue 8/2012
Print ISSN: 0342-4642
Electronic ISSN: 1432-1238
DOI
https://doi.org/10.1007/s00134-012-2579-z

Other articles of this Issue 8/2012

Intensive Care Medicine 8/2012 Go to the issue