Skip to main content
Top
Published in:

Open Access 03-01-2024 | Original Paper

Risk adjustment for regional healthcare funding allocations with ensemble methods: an empirical study and interpretation

Authors: Tuukka Holster, Shaoxiong Ji, Pekka Marttinen

Published in: The European Journal of Health Economics | Issue 7/2024

Login to get access

Abstract

We experiment with recent ensemble machine learning methods in estimating healthcare costs, utilizing Finnish data containing rich individual-level information on healthcare costs, socioeconomic status and diagnostic data from multiple registries. Our data are a random 10% sample (553,675 observations) from the Finnish population in 2017. Using annual healthcare cost in 2017 as a response variable, we compare the performance of Random forest, Gradient Boosting Machine (GBM) and eXtreme Gradient Boosting (XGBoost) to linear regression. As machine learning methods are often seen as unsuitable in risk adjustment applications because of their relative opaqueness, we also introduce visualizations from the machine learning literature to help interpret the contribution of individual variables to the prediction. Our results show that ensemble machine learning methods can improve predictive performance, with all of them significantly outperforming linear regression, and that a certain level of interpretation can be provided for them. We also find individual-level socioeconomic variables to improve prediction accuracy and that their effect is larger for machine learning methods. However, we find that the predictions used for funding allocations are sensitive to model selection, highlighting the need for comprehensive robustness testing when estimating risk adjustment models used in applications.
Literature
3.
go back to reference van Kleef, R. C., Schut, F. T., van de Ven, W. P.: Premium regulation, risk equalization, risk sharing, and subsidies: Effects on affordability and efficiency. In: McGuire. T. G., van Kleef, R. C. (eds.) Risk adjustment, risk sharing and premium regulation in health insurance markets, pp. 21– 54. Academic Press (2018). https://doi.org/10.1016/B978-0-12-811325-7.00002-6 van Kleef, R. C., Schut, F. T., van de Ven, W. P.: Premium regulation, risk equalization, risk sharing, and subsidies: Effects on affordability and efficiency. In: McGuire. T. G., van Kleef, R. C. (eds.) Risk adjustment, risk sharing and premium regulation in health insurance markets, pp. 21– 54. Academic Press (2018). https://​doi.​org/​10.​1016/​B978-0-12-811325-7.​00002-6
4.
go back to reference Chaplin, M., Beatson, S., Yiu-Shing, L., Davies, C., Smyth, C., Burrows, J., Weir, R., Tatarek-Gintowt, R.: Refreshing the Formulae for CCG Allocations. For allocations to Clinical Commissioning Groups from 2016–2017. Report on the methods and modelling. ANHS England, Analytical Services (Finance) (2016) Chaplin, M., Beatson, S., Yiu-Shing, L., Davies, C., Smyth, C., Burrows, J., Weir, R., Tatarek-Gintowt, R.: Refreshing the Formulae for CCG Allocations. For allocations to Clinical Commissioning Groups from 2016–2017. Report on the methods and modelling. ANHS England, Analytical Services (Finance) (2016)
6.
go back to reference Smith, P. C.: Formula funding of public services. Taylor & Francis (2007) Smith, P. C.: Formula funding of public services. Taylor & Francis (2007)
8.
go back to reference Keskimäki, I., Tynkkynen, L.K., Reissell, E., Koivusalo, M., Syrjä, V., Vuorenkoski, L., Rechel, B., Karanikolos, M.: Finland: health system review. Health Syst. Transit. 21(2), 1–166 (2019)PubMed Keskimäki, I., Tynkkynen, L.K., Reissell, E., Koivusalo, M., Syrjä, V., Vuorenkoski, L., Rechel, B., Karanikolos, M.: Finland: health system review. Health Syst. Transit. 21(2), 1–166 (2019)PubMed
9.
go back to reference Häkkinen, U., Holster, T., Haula, T., Kapiainen, S., Kokko, P., Korajoki, M., Mäklin, S., Nguyen, L., Puroharju, T., Peltola, M.: Need adjustment of funding of social and health services [Sote-rahoituksen tarvevakiointi]. THL-raportti 6/2020 (2020) Häkkinen, U., Holster, T., Haula, T., Kapiainen, S., Kokko, P., Korajoki, M., Mäklin, S., Nguyen, L., Puroharju, T., Peltola, M.: Need adjustment of funding of social and health services [Sote-rahoituksen tarvevakiointi]. THL-raportti 6/2020 (2020)
10.
go back to reference Holster, T., Haula, T., Korajoki, M.: Need adjustment of funding of social and health services: 2022 update [Sote-rahoituksen tarvevakiointi: Päivitys 2022]. THL-työpaperi 26/2022 (2022) Holster, T., Haula, T., Korajoki, M.: Need adjustment of funding of social and health services: 2022 update [Sote-rahoituksen tarvevakiointi: Päivitys 2022]. THL-työpaperi 26/2022 (2022)
17.
go back to reference van de Ven, W., Hamstra, G., van Kleef, R.: The goal of risk equalization in regulated competitive health insurance markets. Eur. J. Health Econ. 24, 111–123 (2023)CrossRefPubMed van de Ven, W., Hamstra, G., van Kleef, R.: The goal of risk equalization in regulated competitive health insurance markets. Eur. J. Health Econ. 24, 111–123 (2023)CrossRefPubMed
18.
go back to reference Rose, S.: A machine learning framework for plan payment risk adjustment. Health Serv. Res. 51(6), 2358–2374. Rose, S.: A machine learning framework for plan payment risk adjustment. Health Serv. Res. 51(6), 2358–2374.
20.
go back to reference Irvin, J.A., Kondrich, A.A., Ko, M., Rajpurkar, P., Haghgoo, B., Landon, B.E., Phillips, R.L., Pettersson, S., Ng, A.Y., Basu, S.: Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments. BMC Public Health 20, 1–10 (2020). https://doi.org/10.1186/s12889-020-08735-0CrossRef Irvin, J.A., Kondrich, A.A., Ko, M., Rajpurkar, P., Haghgoo, B., Landon, B.E., Phillips, R.L., Pettersson, S., Ng, A.Y., Basu, S.: Incorporating machine learning and social determinants of health indicators into prospective risk adjustment for health plan payments. BMC Public Health 20, 1–10 (2020). https://​doi.​org/​10.​1186/​s12889-020-08735-0CrossRef
26.
go back to reference Kuhn, M. Johnson, K.: Applied Predictive Modeling. Springer (2013) Kuhn, M. Johnson, K.: Applied Predictive Modeling. Springer (2013)
27.
go back to reference Kuhn, M. Johnson, K.: Feature Engineering and Selection: A Practical Approach for Predictive Modeling. CRC Press (2013) Kuhn, M. Johnson, K.: Feature Engineering and Selection: A Practical Approach for Predictive Modeling. CRC Press (2013)
28.
go back to reference Jones, A.M., Lomas, J., Moore, P.T., Rice, N.: A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs. J. R. Stat. Soc. A. 179(4), 951–974 (2016). https://doi.org/10.1111/rssa.12141CrossRef Jones, A.M., Lomas, J., Moore, P.T., Rice, N.: A quasi-Monte-Carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs. J. R. Stat. Soc. A. 179(4), 951–974 (2016). https://​doi.​org/​10.​1111/​rssa.​12141CrossRef
30.
go back to reference Mäklin, S. Kokko, P.: Unit costs of health care and social services in Finland in 2017. [Terveyden- ja sosiaalihuollon yksikkökustannukset suomessa vuonna 2017]. THL-työpaperi 21/2020 (2021) Mäklin, S. Kokko, P.: Unit costs of health care and social services in Finland in 2017. [Terveyden- ja sosiaalihuollon yksikkökustannukset suomessa vuonna 2017]. THL-työpaperi 21/2020 (2021)
31.
go back to reference UNESCO-UIS: International Standard Classification of Education ISCED 2011. UNESCO Institute for Statistics (UIS) (2012) UNESCO-UIS: International Standard Classification of Education ISCED 2011. UNESCO Institute for Statistics (UIS) (2012)
32.
go back to reference Dudley, R. A., Medlin, C. A., Hammann, L. B., Cisternas, M. G., Brand, R., Rennie, D. J., Luft, H. S.: The best of both worlds? Potential of hybrid prospective/concurrent risk adjustment. Medical Care 56–69 (2003) Dudley, R. A., Medlin, C. A., Hammann, L. B., Cisternas, M. G., Brand, R., Rennie, D. J., Luft, H. S.: The best of both worlds? Potential of hybrid prospective/concurrent risk adjustment. Medical Care 56–69 (2003)
33.
go back to reference Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Chapman and Hall (1984) Breiman, L., Friedman, J., Olshen, R., Stone, C.: Classification and Regression Trees. Chapman and Hall (1984)
34.
go back to reference Wright, M. N., Wager, S., Probst, P.: ranger: A Fast Implementation of Random Forests. R package version 0.12.1 (2020) Wright, M. N., Wager, S., Probst, P.: ranger: A Fast Implementation of Random Forests. R package version 0.12.1 (2020)
35.
go back to reference Greenwell, B., Boehmke, B., Cunningham, J.: gbm: Generalized Boosted Regression Models. R package version 2.1.8 (2020a) Greenwell, B., Boehmke, B., Cunningham, J.: gbm: Generalized Boosted Regression Models. R package version 2.1.8 (2020a)
36.
go back to reference Chen, T. He, T., Benesty, M., Khotilovich, V., Tang, Y., Cho, H., Chen, K., Mitchell, R., Cano, I., Zhou, T., Li, M., Xie, J., Lin, M., Geng, Y., Li, Y.: xgboost: Extreme Gradient Boosting. R package version 1.4.1.1 (2021) Chen, T. He, T., Benesty, M., Khotilovich, V., Tang, Y., Cho, H., Chen, K., Mitchell, R., Cano, I., Zhou, T., Li, M., Xie, J., Lin, M., Geng, Y., Li, Y.: xgboost: Extreme Gradient Boosting. R package version 1.4.1.1 (2021)
38.
go back to reference Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)CrossRef Friedman, J.H.: Greedy function approximation: a gradient boosting machine. Ann. Stat. 29(5), 1189–1232 (2001)CrossRef
40.
go back to reference Chen, T., Guestrin, C.: XGBoost: A scalable tree boosting system. In Proceedings of ACM SIGKDD, 785–794. Chen, T., Guestrin, C.: XGBoost: A scalable tree boosting system. In Proceedings of ACM SIGKDD, 785–794.
42.
go back to reference Greenwell, B., Boehmke, B., Gray, B.: vip: Variable Importance Plots. R package version 0.3.2 (2020b) Greenwell, B., Boehmke, B., Gray, B.: vip: Variable Importance Plots. R package version 0.3.2 (2020b)
43.
go back to reference Greenwell, B.M.: pdp: an R package for constructing partial dependence plots. R J 9(1), 421–436 (2017)CrossRef Greenwell, B.M.: pdp: an R package for constructing partial dependence plots. R J 9(1), 421–436 (2017)CrossRef
44.
go back to reference Ribeiro, M. T., Singh, S., Guestrin, C.: “Why should I trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135–1144 (2016) Ribeiro, M. T., Singh, S., Guestrin, C.: “Why should I trust you?” Explaining the predictions of any classifier. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 1135–1144 (2016)
46.
go back to reference Pedersen, T. L., Benesty, M.: lime: Local Interpretable Model-Agnostic Explanations. R package version 0.5.2 (2021) Pedersen, T. L., Benesty, M.: lime: Local Interpretable Model-Agnostic Explanations. R package version 0.5.2 (2021)
Metadata
Title
Risk adjustment for regional healthcare funding allocations with ensemble methods: an empirical study and interpretation
Authors
Tuukka Holster
Shaoxiong Ji
Pekka Marttinen
Publication date
03-01-2024
Publisher
Springer Berlin Heidelberg
Published in
The European Journal of Health Economics / Issue 7/2024
Print ISSN: 1618-7598
Electronic ISSN: 1618-7601
DOI
https://doi.org/10.1007/s10198-023-01656-w