Skip to main content
Top
Published in: BMC Health Services Research 1/2023

Open Access 01-12-2023 | Research

Outlier identification and monitoring of institutional or clinician performance: an overview of statistical methods and application to national audit data

Authors: Menelaos Pavlou, Gareth Ambler, Rumana Z. Omar, Andrew T. Goodwin, Uday Trivedi, Peter Ludman, Mark de Belder

Published in: BMC Health Services Research | Issue 1/2023

Login to get access

Abstract

Background

Institutions or clinicians (units) are often compared according to a performance indicator such as in-hospital mortality. Several approaches have been proposed for the detection of outlying units, whose performance deviates from the overall performance.

Methods

We provide an overview of three approaches commonly used to monitor institutional performances for outlier detection. These are the common-mean model, the ‘Normal-Poisson’ random effects model and the ‘Logistic’ random effects model. For the latter we also propose a visualisation technique. The common-mean model assumes that the underlying true performance of all units is equal and that any observed variation between units is due to chance. Even after applying case-mix adjustment, this assumption is often violated due to overdispersion and a post-hoc correction may need to be applied. The random effects models relax this assumption and explicitly allow the true performance to differ between units, thus offering a more flexible approach. We discuss the strengths and weaknesses of each approach and illustrate their application using audit data from England and Wales on Adult Cardiac Surgery (ACS) and Percutaneous Coronary Intervention (PCI).

Results

In general, the overdispersion-corrected common-mean model and the random effects approaches produced similar p-values for the detection of outliers. For the ACS dataset (41 hospitals) three outliers were identified in total but only one was identified by all methods above. For the PCI dataset (88 hospitals), seven outliers were identified in total but only two were identified by all methods. The common-mean model uncorrected for overdispersion produced several more outliers. The reason for observing similar p-values for all three approaches could be attributed to the fact that the between-hospital variance was relatively small in both datasets, resulting only in a mild violation of the common-mean assumption; in this situation, the overdispersion correction worked well.

Conclusion

If the common-mean assumption is likely to hold, all three methods are appropriate to use for outlier detection and their results should be similar. Random effect methods may be the preferred approach when the common-mean assumption is likely to be violated.
Appendix
Available only for authorised users
Literature
1.
go back to reference Spiegelhalter DJ. Funnel plots for comparing institutional performance. Stat Med. 2005;24(8):1185–202.CrossRef Spiegelhalter DJ. Funnel plots for comparing institutional performance. Stat Med. 2005;24(8):1185–202.CrossRef
2.
go back to reference Jones HE, Spiegelhalter DJ. The identification of “unusual” health-care providers from a hierarchical model. Am Stat. 2011;65(3):154–63.CrossRef Jones HE, Spiegelhalter DJ. The identification of “unusual” health-care providers from a hierarchical model. Am Stat. 2011;65(3):154–63.CrossRef
3.
go back to reference Nashef SA, Roques F, Michel P, Gauducheau E, Lemeshow S, Salamon R. European system for cardiac operative risk evaluation (EuroSCORE). Eur J Cardiothorac Surg. 1999;16(1):9–13.CrossRef Nashef SA, Roques F, Michel P, Gauducheau E, Lemeshow S, Salamon R. European system for cardiac operative risk evaluation (EuroSCORE). Eur J Cardiothorac Surg. 1999;16(1):9–13.CrossRef
4.
go back to reference Roques F, Michel P, Goldstone AR, Nashef SA. The logistic EuroSCORE. Eur Heart J. 2003;24(9):881–2.CrossRef Roques F, Michel P, Goldstone AR, Nashef SA. The logistic EuroSCORE. Eur Heart J. 2003;24(9):881–2.CrossRef
5.
go back to reference Spiegelhalter DJ. Handling over-dispersion of performance indicators. Qual Saf Health Care. 2005;14(5):347–51.CrossRef Spiegelhalter DJ. Handling over-dispersion of performance indicators. Qual Saf Health Care. 2005;14(5):347–51.CrossRef
6.
go back to reference Skrondal A, Rabe-Hesketh S. Prediction in multilevel generalized linear models. J R Stat Soc Ser A Stat Soc. 2009;172(3):659–87.CrossRef Skrondal A, Rabe-Hesketh S. Prediction in multilevel generalized linear models. J R Stat Soc Ser A Stat Soc. 2009;172(3):659–87.CrossRef
7.
go back to reference Spiegelhalter D, Sherlaw-Johnson C, Bardsley M, Blunt I, Wood C, Grigg O. Statistical methods for healthcare regulation: rating, screening and surveillance. J R Stat Soc Ser A Stat Soc. 2012;175(1):1–47.CrossRef Spiegelhalter D, Sherlaw-Johnson C, Bardsley M, Blunt I, Wood C, Grigg O. Statistical methods for healthcare regulation: rating, screening and surveillance. J R Stat Soc Ser A Stat Soc. 2012;175(1):1–47.CrossRef
8.
go back to reference Breslow NE, Day NE. Statistical methods in cancer research. Volume II--The design and analysis of cohort studies. IARC Sci Publ. 1987;82:1–406. Breslow NE, Day NE. Statistical methods in cancer research. Volume II--The design and analysis of cohort studies. IARC Sci Publ. 1987;82:1–406.
9.
go back to reference Eldridge SM, Ukoumunne OC, Carlin JB. The intra-cluster correlation coefficient in cluster randomized trials: a review of definitions. Int Stat Rev. 2009;77(3):378–94.CrossRef Eldridge SM, Ukoumunne OC, Carlin JB. The intra-cluster correlation coefficient in cluster randomized trials: a review of definitions. Int Stat Rev. 2009;77(3):378–94.CrossRef
10.
go back to reference Efron B, Morris C. Stein's paradox in statistics. Scientific American - SCI AMER. 1977;236:119–27.CrossRef Efron B, Morris C. Stein's paradox in statistics. Scientific American - SCI AMER. 1977;236:119–27.CrossRef
11.
go back to reference MacKenzie TA, Grunkemeier GL, Grunwald GK, O’Malley AJ, Bohn C, Wu Y, et al. A primer on using shrinkage to compare in-hospital mortality between centers. Ann Thorac Surg. 2015;99(3):757–61.CrossRef MacKenzie TA, Grunkemeier GL, Grunwald GK, O’Malley AJ, Bohn C, Wu Y, et al. A primer on using shrinkage to compare in-hospital mortality between centers. Ann Thorac Surg. 2015;99(3):757–61.CrossRef
12.
go back to reference McAllister KS, Ludman PF, Hulme W, de Belder MA, Stables R, Chowdhary S, et al. A contemporary risk model for predicting 30-day mortality following percutaneous coronary intervention in England and Wales. Int J Cardiol. 2016;210:125–32.CrossRef McAllister KS, Ludman PF, Hulme W, de Belder MA, Stables R, Chowdhary S, et al. A contemporary risk model for predicting 30-day mortality following percutaneous coronary intervention in England and Wales. Int J Cardiol. 2016;210:125–32.CrossRef
13.
go back to reference Mohammed MA, Manktelow BN, Hofer TP. Comparison of four methods for deriving hospital standardised mortality ratios from a single hierarchical logistic regression model. Stat Methods Med Res. 2016;25(2):706–15.CrossRef Mohammed MA, Manktelow BN, Hofer TP. Comparison of four methods for deriving hospital standardised mortality ratios from a single hierarchical logistic regression model. Stat Methods Med Res. 2016;25(2):706–15.CrossRef
14.
go back to reference DeLong ER, Peterson ED, DeLong DM, Muhlbaier LH, Hackett S, Mark DB. Comparing risk-adjustment methods for provider profiling. Stat Med. 1997;16(23):2645–64.CrossRef DeLong ER, Peterson ED, DeLong DM, Muhlbaier LH, Hackett S, Mark DB. Comparing risk-adjustment methods for provider profiling. Stat Med. 1997;16(23):2645–64.CrossRef
15.
go back to reference Ohlssen DI, Sharples LD, Spiegelhalter DJ. A hierarchical modelling framework for identifying unusual performance in health care providers. J R Stat Soc Ser A Stat Soc. 2007;170(4):865–90.CrossRef Ohlssen DI, Sharples LD, Spiegelhalter DJ. A hierarchical modelling framework for identifying unusual performance in health care providers. J R Stat Soc Ser A Stat Soc. 2007;170(4):865–90.CrossRef
16.
go back to reference Racz MJ, Sedransk J. Bayesian and frequentist methods for provider profiling using risk-adjusted assessments of medical outcomes. J Am Stat Assoc. 2010;105(489):48–58.CrossRef Racz MJ, Sedransk J. Bayesian and frequentist methods for provider profiling using risk-adjusted assessments of medical outcomes. J Am Stat Assoc. 2010;105(489):48–58.CrossRef
17.
go back to reference Austin PC, Alter DA, Tu JV. The use of fixed- and random-effects models for classifying hospitals as mortality outliers: a Monte Carlo assessment. Med Decis Making. 2003;23(6):526–39.CrossRef Austin PC, Alter DA, Tu JV. The use of fixed- and random-effects models for classifying hospitals as mortality outliers: a Monte Carlo assessment. Med Decis Making. 2003;23(6):526–39.CrossRef
18.
go back to reference Jones HE, Ohlssen DI, Spiegelhalter DJ. Use of the false discovery rate when comparing multiple health care providers. J Clin Epidemiol. 2008;61(3):232–40.CrossRef Jones HE, Ohlssen DI, Spiegelhalter DJ. Use of the false discovery rate when comparing multiple health care providers. J Clin Epidemiol. 2008;61(3):232–40.CrossRef
19.
go back to reference Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57(1):289–300. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B Methodol. 1995;57(1):289–300.
Metadata
Title
Outlier identification and monitoring of institutional or clinician performance: an overview of statistical methods and application to national audit data
Authors
Menelaos Pavlou
Gareth Ambler
Rumana Z. Omar
Andrew T. Goodwin
Uday Trivedi
Peter Ludman
Mark de Belder
Publication date
01-12-2023
Publisher
BioMed Central
Published in
BMC Health Services Research / Issue 1/2023
Electronic ISSN: 1472-6963
DOI
https://doi.org/10.1186/s12913-022-08995-z

Other articles of this Issue 1/2023

BMC Health Services Research 1/2023 Go to the issue