Skip to main content
Top
Published in: Critical Care 5/2005

Open Access 01-10-2005 | Research

Application of a population-based severity scoring system to individual patients results in frequent misclassification

Authors: Frank V Booth, Mary Short, Andrew F Shorr, Nancy Arkins, Becky Bates, Rebecca L Qualy, Howard Levy

Published in: Critical Care | Issue 5/2005

Login to get access

Abstract

Introduction

APACHE II (AP2) was developed to allow a systematic examination of intensive care unit outcomes in a risk adjusted manner. AP2 has been widely adopted in clinical trials to assure broad consistency amongst different groups. Although errors in calculating the true AP2 score may not be reducible below 15%, the self-canceling effect of random errors reduces the importance of such errors when applied to large populations. It has been suggested that a threshold AP2 score be used in clinical decision making for individual patients. This study reports the AP2 scoring errors of researchers involved in a large sepsis trial and models the consequences of such an error rate for individual severe sepsis patients.

Methods

Fifty-six researchers with explicit training in data abstraction and completion of the AP2 score received scenarios consisting of composites of real patient histories. Descriptive statistics were calculated for each scenario. The standard deviations were calculated compared with an adjudicated score. Intraclass correlations for inter-observer reliability were performed using Shrout-Fleiss methodology. Theoretical distribution curves were calculated for a broad range of AP2 scores using standard deviations of 6, 9 and 12. For each curve, the misclassification rate was determined using an AP2 score cut-off of ≥25. The percentage of misclassifications for each true AP2 score was then applied to the corresponding AP2 score obtained from the PROGRESS severe sepsis registry.

Results

The error rate for the total AP2 score was 86% (individual variables were in the range 10% to 87%). Intraclass correlation for the inter-observer reliability was 0.51. Of the patients from the PROGRESS registry. 50% had AP2 scores in the range 17 to 28. Within this interquartile range, 70% to 85% of all misclassified patients would reside.

Conclusion

It is more likely that an individual patient will be scored incorrectly than correctly. The data obtained from the scenarios indicated that as the true AP2 score approached an arbitrary cut-off point of 25, the observed misclassification rate increased. Integrating our study of AP2 score errors with the published literature leads us to conclude that the AP2 is an inappropriate sole tool for resource allocation decisions for individual patients.
Appendix
Available only for authorised users
Literature
1.
go back to reference Knaus WA, Draper EA, Wagner DP, Zimmerman JE: APACHE II: a severity of disease classification system. Crit Care Med 1985, 13: 818-829.CrossRefPubMed Knaus WA, Draper EA, Wagner DP, Zimmerman JE: APACHE II: a severity of disease classification system. Crit Care Med 1985, 13: 818-829.CrossRefPubMed
2.
go back to reference Shrout PE, Fleiss JL: Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin 1979, 86: 420-429. 10.1037//0033-2909.86.2.420CrossRefPubMed Shrout PE, Fleiss JL: Intraclass correlations: uses in assessing rater reliability. Psychological Bulletin 1979, 86: 420-429. 10.1037//0033-2909.86.2.420CrossRefPubMed
3.
go back to reference Beale R, Reinhart K, Silva E, Dobb G, Sarwat S, Garg R, Vincent JL: Comparison of PROGRESS Severe Sepsis Registry patients to INDEPTH Integrated Severe Sepsis Clinical Trial Database placebo patients. Chest Meeting Abstracts; Chest 2004, 126: 864S. Abstract #496 Beale R, Reinhart K, Silva E, Dobb G, Sarwat S, Garg R, Vincent JL: Comparison of PROGRESS Severe Sepsis Registry patients to INDEPTH Integrated Severe Sepsis Clinical Trial Database placebo patients. Chest Meeting Abstracts; Chest 2004, 126: 864S. Abstract #496
4.
go back to reference Polderman KH, Thijs LG, Girbes AR: Interobserver variability in the use of APACHE II scores. Lancet 1999, 353: 380. 10.1016/S0140-6736(05)74953-9CrossRefPubMed Polderman KH, Thijs LG, Girbes AR: Interobserver variability in the use of APACHE II scores. Lancet 1999, 353: 380. 10.1016/S0140-6736(05)74953-9CrossRefPubMed
5.
go back to reference Chen LM, Martin CM, Morrison TL, Sibbald WJ: Interobserver variability in data collection of the APACHE II score in teaching and community hospitals. Crit Care Med 1999, 27: 1999-2004. 10.1097/00003246-199909000-00046CrossRefPubMed Chen LM, Martin CM, Morrison TL, Sibbald WJ: Interobserver variability in data collection of the APACHE II score in teaching and community hospitals. Crit Care Med 1999, 27: 1999-2004. 10.1097/00003246-199909000-00046CrossRefPubMed
6.
go back to reference Rowley G, Fielding K: Reliability and accuracy of the Glasgow Coma Scale with experienced and inexperienced users. Lancet 1991, 337: 535-538. 10.1016/0140-6736(91)91309-ICrossRefPubMed Rowley G, Fielding K: Reliability and accuracy of the Glasgow Coma Scale with experienced and inexperienced users. Lancet 1991, 337: 535-538. 10.1016/0140-6736(91)91309-ICrossRefPubMed
7.
go back to reference Cerra FB, Negro F, Abrams J: APACHE II score does not predict multiple organ failure or mortality in postoperative surgical patients. Arch Surg 1990, 125: 519-522.CrossRefPubMed Cerra FB, Negro F, Abrams J: APACHE II score does not predict multiple organ failure or mortality in postoperative surgical patients. Arch Surg 1990, 125: 519-522.CrossRefPubMed
8.
go back to reference Bernard GR, Vincent J-L, Laterre P-F, LaRosa SP, Dhainaut J-F, Lopez-Rodriguez A, Steingrub JS, Garber GE, Helterbrand JD, Ely EW, et al.: Efficacy and safety of recombinant human activated protein C for severe sepsis. N Engl J Med 2001, 344: 699-709. 10.1056/NEJM200103083441001CrossRefPubMed Bernard GR, Vincent J-L, Laterre P-F, LaRosa SP, Dhainaut J-F, Lopez-Rodriguez A, Steingrub JS, Garber GE, Helterbrand JD, Ely EW, et al.: Efficacy and safety of recombinant human activated protein C for severe sepsis. N Engl J Med 2001, 344: 699-709. 10.1056/NEJM200103083441001CrossRefPubMed
Metadata
Title
Application of a population-based severity scoring system to individual patients results in frequent misclassification
Authors
Frank V Booth
Mary Short
Andrew F Shorr
Nancy Arkins
Becky Bates
Rebecca L Qualy
Howard Levy
Publication date
01-10-2005
Publisher
BioMed Central
Published in
Critical Care / Issue 5/2005
Electronic ISSN: 1364-8535
DOI
https://doi.org/10.1186/cc3790

Other articles of this Issue 5/2005

Critical Care 5/2005 Go to the issue