Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2017

Open Access 01-12-2017 | Research article

Tweedie distributions for fitting semicontinuous health care utilization cost data

Author: Christoph F. Kurz

Published in: BMC Medical Research Methodology | Issue 1/2017

Login to get access

Abstract

Background

The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users.

Methods

I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment.

Results

I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high.

Conclusion

The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses.
Literature
1.
go back to reference Min Y, Agresti A. Modeling nonnegative data with clumping at zero: a survey. J Iranian Stat Soc. 2002; 1(1):7–33. Min Y, Agresti A. Modeling nonnegative data with clumping at zero: a survey. J Iranian Stat Soc. 2002; 1(1):7–33.
2.
go back to reference Duan N, Manning WG, Morris CN, Newhouse JP. A comparison of alternative models for the demand for medical care. J Bus Econ Stat. 1983; 1(2):115–26. Duan N, Manning WG, Morris CN, Newhouse JP. A comparison of alternative models for the demand for medical care. J Bus Econ Stat. 1983; 1(2):115–26.
3.
go back to reference Winkelmann R. Health care reform and the number of doctor visits–an econometric analysis. J Appl Econ. 2004; 19(4):455–72.CrossRef Winkelmann R. Health care reform and the number of doctor visits–an econometric analysis. J Appl Econ. 2004; 19(4):455–72.CrossRef
4.
go back to reference Van Ophem H. The frequency of visiting a doctor: is the decision to go independent of the frequency?J Appl Econ. 2011; 26(5):872–9.CrossRef Van Ophem H. The frequency of visiting a doctor: is the decision to go independent of the frequency?J Appl Econ. 2011; 26(5):872–9.CrossRef
5.
go back to reference Tobin J. Estimation of relationships for limited dependent variables. Econometrica J Econometric Soc. 1958; 26(1):24–36.CrossRef Tobin J. Estimation of relationships for limited dependent variables. Econometrica J Econometric Soc. 1958; 26(1):24–36.CrossRef
6.
go back to reference Buntin MB, Zaslavsky AM. Too much ado about two-part models and transformation?: Comparing methods of modeling medicare expenditures. J Health Economics. 2004; 23(3):525–42.CrossRef Buntin MB, Zaslavsky AM. Too much ado about two-part models and transformation?: Comparing methods of modeling medicare expenditures. J Health Economics. 2004; 23(3):525–42.CrossRef
7.
go back to reference Manning WG, Basu A, Mullahy J. Generalized modeling approaches to risk adjustment of skewed outcomes data. J Health Economics. 2005; 24(3):465–88.CrossRef Manning WG, Basu A, Mullahy J. Generalized modeling approaches to risk adjustment of skewed outcomes data. J Health Economics. 2005; 24(3):465–88.CrossRef
8.
go back to reference Jones AM, Lomas J, Moore PT, Rice N. A quasi-monte-carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs. J Royal Stat Soc Ser A (Stat Soc). 2016; 179(4):951–74. doi:10.1111/rssa.12141.CrossRef Jones AM, Lomas J, Moore PT, Rice N. A quasi-monte-carlo comparison of parametric and semiparametric regression methods for heavy-tailed and non-normal data: an application to healthcare costs. J Royal Stat Soc Ser A (Stat Soc). 2016; 179(4):951–74. doi:10.​1111/​rssa.​12141.CrossRef
9.
go back to reference Basu A, Arondekar BV, Rathouz PJ. Scale of interest versus scale of estimation: comparing alternative estimators for the incremental costs of a comorbidity. Health Econ. 2006; 15(10):1091–1107. doi:10.1002/hec.1099.CrossRefPubMed Basu A, Arondekar BV, Rathouz PJ. Scale of interest versus scale of estimation: comparing alternative estimators for the incremental costs of a comorbidity. Health Econ. 2006; 15(10):1091–1107. doi:10.​1002/​hec.​1099.CrossRefPubMed
10.
go back to reference Hill SC, Miller GE. Health expenditure estimation and functional form: applications of the generalized gamma and extended estimating equations models. Health Econ. 2010; 19(5):608–27. doi:10.1002/hec.1498.PubMed Hill SC, Miller GE. Health expenditure estimation and functional form: applications of the generalized gamma and extended estimating equations models. Health Econ. 2010; 19(5):608–27. doi:10.​1002/​hec.​1498.PubMed
11.
12.
go back to reference Jorgensen B. The Theory of Dispersion Models, 1 edition: Chapman and Hall/CRC; 1997. Jorgensen B. The Theory of Dispersion Models, 1 edition: Chapman and Hall/CRC; 1997.
13.
go back to reference Dunn PK. Occurrence and quantity of precipitation can be modelled simultaneously. Int J Climatol. 2004; 24(10):1231–1239.CrossRef Dunn PK. Occurrence and quantity of precipitation can be modelled simultaneously. Int J Climatol. 2004; 24(10):1231–1239.CrossRef
14.
go back to reference Smyth GK, Jørgensen B. Fitting tweedie’s compound poisson model to insurance claims data: dispersion modelling. Astin Bulletin. 2002; 32(01):143–57.CrossRef Smyth GK, Jørgensen B. Fitting tweedie’s compound poisson model to insurance claims data: dispersion modelling. Astin Bulletin. 2002; 32(01):143–57.CrossRef
15.
go back to reference Mihaylova B, Briggs A, O’Hagan A, Thompson SG. Review of statistical methods for analysing healthcare resources and costs. Health Econ. 2011; 20(8):897–916.CrossRefPubMed Mihaylova B, Briggs A, O’Hagan A, Thompson SG. Review of statistical methods for analysing healthcare resources and costs. Health Econ. 2011; 20(8):897–916.CrossRefPubMed
16.
go back to reference Zhang Y. Likelihood-based and bayesian methods for tweedie compound poisson linear mixed models. Stat Comput. 2013; 23(6):743–57.CrossRef Zhang Y. Likelihood-based and bayesian methods for tweedie compound poisson linear mixed models. Stat Comput. 2013; 23(6):743–57.CrossRef
17.
go back to reference Dunn PK, Smyth GK. Evaluation of tweedie exponential dispersion model densities by fourier inversion. Stat Comput. 2008; 18(1):73–86.CrossRef Dunn PK, Smyth GK. Evaluation of tweedie exponential dispersion model densities by fourier inversion. Stat Comput. 2008; 18(1):73–86.CrossRef
18.
go back to reference McCullagh P, Nelder JA. Generalized Linear Models. Vol. 37: CRC press; 1989. McCullagh P, Nelder JA. Generalized Linear Models. Vol. 37: CRC press; 1989.
19.
go back to reference Deb P, Trivedi PK. The structure of demand for health care: latent class versus two-part models. J Health Econ. 2002; 21(4):601–25.CrossRefPubMed Deb P, Trivedi PK. The structure of demand for health care: latent class versus two-part models. J Health Econ. 2002; 21(4):601–25.CrossRefPubMed
20.
go back to reference Basu A, Rathouz PJ. Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. Biostatistics. 2005; 6(1):93–109.CrossRefPubMed Basu A, Rathouz PJ. Estimating marginal and incremental effects on health outcomes using flexible link and variance function models. Biostatistics. 2005; 6(1):93–109.CrossRefPubMed
21.
22.
go back to reference Deb P, Burgess JF. A quasi-experimental comparison of econometric models for health care expenditures. Hunter College Department of Economics Working Papers. 2003;212. Deb P, Burgess JF. A quasi-experimental comparison of econometric models for health care expenditures. Hunter College Department of Economics Working Papers. 2003;212.
23.
go back to reference Dunn PK. Tweedie: Tweedie Exponential Family Models. 2014:1–32. R package version 2.2.1. Dunn PK. Tweedie: Tweedie Exponential Family Models. 2014:1–32. R package version 2.2.1.
24.
go back to reference Swallow B, Buckland ST, King R, Toms MP. Bayesian hierarchical modelling of continuous non-negative longitudinal data with a spike at zero: An application to a study of birds visiting gardens in winter. Biometrical J. 2016; 58(2):357–71.CrossRef Swallow B, Buckland ST, King R, Toms MP. Bayesian hierarchical modelling of continuous non-negative longitudinal data with a spike at zero: An application to a study of birds visiting gardens in winter. Biometrical J. 2016; 58(2):357–71.CrossRef
Metadata
Title
Tweedie distributions for fitting semicontinuous health care utilization cost data
Author
Christoph F. Kurz
Publication date
01-12-2017
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2017
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-017-0445-y

Other articles of this Issue 1/2017

BMC Medical Research Methodology 1/2017 Go to the issue