Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2011

Open Access 01-12-2011 | Research article

Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes

Authors: Baoyue Li, Hester F Lingsma, Ewout W Steyerberg, Emmanuel Lesaffre

Published in: BMC Medical Research Methodology | Issue 1/2011

Login to get access

Abstract

Background

Logistic random effects models are a popular tool to analyze multilevel also called hierarchical data with a binary or ordinal outcome. Here, we aim to compare different statistical software implementations of these models.

Methods

We used individual patient data from 8509 patients in 231 centers with moderate and severe Traumatic Brain Injury (TBI) enrolled in eight Randomized Controlled Trials (RCTs) and three observational studies. We fitted logistic random effects regression models with the 5-point Glasgow Outcome Scale (GOS) as outcome, both dichotomized as well as ordinal, with center and/or trial as random effects, and as covariates age, motor score, pupil reactivity or trial. We then compared the implementations of frequentist and Bayesian methods to estimate the fixed and random effects. Frequentist approaches included R (lme4), Stata (GLLAMM), SAS (GLIMMIX and NLMIXED), MLwiN ([R]IGLS) and MIXOR, Bayesian approaches included WinBUGS, MLwiN (MCMC), R package MCMCglmm and SAS experimental procedure MCMC.
Three data sets (the full data set and two sub-datasets) were analysed using basically two logistic random effects models with either one random effect for the center or two random effects for center and trial. For the ordinal outcome in the full data set also a proportional odds model with a random center effect was fitted.

Results

The packages gave similar parameter estimates for both the fixed and random effects and for the binary (and ordinal) models for the main study and when based on a relatively large number of level-1 (patient level) data compared to the number of level-2 (hospital level) data. However, when based on relatively sparse data set, i.e. when the numbers of level-1 and level-2 data units were about the same, the frequentist and Bayesian approaches showed somewhat different results. The software implementations differ considerably in flexibility, computation time, and usability. There are also differences in the availability of additional tools for model evaluation, such as diagnostic plots. The experimental SAS (version 9.2) procedure MCMC appeared to be inefficient.

Conclusions

On relatively large data sets, the different software implementations of logistic random effects regression models produced similar results. Thus, for a large data set there seems to be no explicit preference (of course if there is no preference from a philosophical point of view) for either a frequentist or Bayesian approach (if based on vague priors). The choice for a particular implementation may largely depend on the desired flexibility, and the usability of the package. For small data sets the random effects variances are difficult to estimate. In the frequentist approaches the MLE of this variance was often estimated zero with a standard error that is either zero or could not be determined, while for Bayesian methods the estimates could depend on the chosen "non-informative" prior of the variance parameter. The starting value for the variance parameter may be also critical for the convergence of the Markov chain.
Appendix
Available only for authorised users
Literature
2.
go back to reference Molenberghs G, Verbeke G: Models for Discrete Longitudinal Data. 2005, Berlin, Springer Molenberghs G, Verbeke G: Models for Discrete Longitudinal Data. 2005, Berlin, Springer
3.
go back to reference Peter C, Jack V, David A: Comparing hierarchical modeling with traditional logistic regression analysis among patients hospitalized with acute myocardial infarction: should we be analyzing cardiovascular outcomes data differently?. American Heart Journal. 2003, 145: 27-35. 10.1067/mhj.2003.23.CrossRef Peter C, Jack V, David A: Comparing hierarchical modeling with traditional logistic regression analysis among patients hospitalized with acute myocardial infarction: should we be analyzing cardiovascular outcomes data differently?. American Heart Journal. 2003, 145: 27-35. 10.1067/mhj.2003.23.CrossRef
4.
go back to reference Zhou X, Perkins AJ, Hui SL: Comparisons of software packages for generalized linear multilevel models. The American Statistician. 1999, 53: 282-290. 10.2307/2686112. Zhou X, Perkins AJ, Hui SL: Comparisons of software packages for generalized linear multilevel models. The American Statistician. 1999, 53: 282-290. 10.2307/2686112.
5.
go back to reference Guo G, Zhao H: Multilevel modeling for binary data. Annual Review of Sociology. 2000, 26: 441-462. 10.1146/annurev.soc.26.1.441.CrossRef Guo G, Zhao H: Multilevel modeling for binary data. Annual Review of Sociology. 2000, 26: 441-462. 10.1146/annurev.soc.26.1.441.CrossRef
7.
go back to reference Marmarou A, Lu J, Butcher I, McHugh GS, Mushkudiani NA, Murray GD, Steyerberg EW: IMPACT Database of Traumatic Brain Injury: Design and Description. Journal of Neurotrauma. 2007, 24: 239-250. 10.1089/neu.2006.0036.CrossRefPubMed Marmarou A, Lu J, Butcher I, McHugh GS, Mushkudiani NA, Murray GD, Steyerberg EW: IMPACT Database of Traumatic Brain Injury: Design and Description. Journal of Neurotrauma. 2007, 24: 239-250. 10.1089/neu.2006.0036.CrossRefPubMed
8.
go back to reference Maas AI, Marmarou A, Murray GD, Teasdale SG, Steyerberg EW: Prognosis and Clinical Trial Design in Traumatic Brain Injury: The IMPACT Study. Journal of Neurotrauma. 2007, 24: 232-238. 10.1089/neu.2006.0024.CrossRefPubMed Maas AI, Marmarou A, Murray GD, Teasdale SG, Steyerberg EW: Prognosis and Clinical Trial Design in Traumatic Brain Injury: The IMPACT Study. Journal of Neurotrauma. 2007, 24: 232-238. 10.1089/neu.2006.0024.CrossRefPubMed
9.
go back to reference Steyerberg EW, Mushkudiani N, Perel P, Butcher I, Lu J, McHugh GS: Predicting outcome after traumatic brain injury: development and international validation of prognostic scores based on admission characteristics. PLoS Medicine. 2008, 5: 1251-1261.CrossRef Steyerberg EW, Mushkudiani N, Perel P, Butcher I, Lu J, McHugh GS: Predicting outcome after traumatic brain injury: development and international validation of prognostic scores based on admission characteristics. PLoS Medicine. 2008, 5: 1251-1261.CrossRef
10.
go back to reference McHugh GS, Butcher I, Steyerberg EW, Lu J, Mushkudiani NA, Marmarou A, Maas AI, Murray GD: Statistical approaches to the univariate prognostic analysis of the IMPACT database on traumatic brain injury. Journal of Neurotrauma. 2007, 24: 251-258. 10.1089/neu.2006.0026.CrossRefPubMed McHugh GS, Butcher I, Steyerberg EW, Lu J, Mushkudiani NA, Marmarou A, Maas AI, Murray GD: Statistical approaches to the univariate prognostic analysis of the IMPACT database on traumatic brain injury. Journal of Neurotrauma. 2007, 24: 251-258. 10.1089/neu.2006.0026.CrossRefPubMed
11.
go back to reference Goldstein H: Multilevel Statistical Models. 1995, London, Edward Arnold, 2 Goldstein H: Multilevel Statistical Models. 1995, London, Edward Arnold, 2
12.
go back to reference Snijders T, Bosker R: Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling. 1999, London, Sage Publications Snijders T, Bosker R: Multilevel Analysis: An Introduction to Basic and Advanced Multilevel Modeling. 1999, London, Sage Publications
13.
go back to reference Rodriguez G, Elo I: Intra-class correlation in random-effects models for binary data. The Stata Journal. 2003, 3: 32-46. Rodriguez G, Elo I: Intra-class correlation in random-effects models for binary data. The Stata Journal. 2003, 3: 32-46.
14.
go back to reference Pendergast JF, Gange SJ, Newton MA, Lindstrom MJ, Palta M, Fisher MR: A survey of methods for analyzing clustered binary response data. International Statistical Review. 1991, 64: 89-118.CrossRef Pendergast JF, Gange SJ, Newton MA, Lindstrom MJ, Palta M, Fisher MR: A survey of methods for analyzing clustered binary response data. International Statistical Review. 1991, 64: 89-118.CrossRef
16.
go back to reference Rabe-Hesketh S, Skrondal A, Pickles A: GLLAMM Manual. 2004 Rabe-Hesketh S, Skrondal A, Pickles A: GLLAMM Manual. 2004
17.
go back to reference The GLIMMIX procedure. SAS/STAT User's Guide. 2009, Version 9.2 The GLIMMIX procedure. SAS/STAT User's Guide. 2009, Version 9.2
18.
go back to reference The NLMIXED procedure. SAS/STAT User's Guide. 2009, Version 9.2 The NLMIXED procedure. SAS/STAT User's Guide. 2009, Version 9.2
19.
go back to reference Rasbash J, Steele F, Browne WJ, Goldstein H: A User's Guide to MLwiN. 2004, version 2.10 Rasbash J, Steele F, Browne WJ, Goldstein H: A User's Guide to MLwiN. 2004, version 2.10
20.
go back to reference Donald H, Robert DG: MIXOR: a computer program for mixed-effects ordinal regression analysis. Computer Methods and Programs in Biomedicine. 1996, 49: 157-176. 10.1016/0169-2607(96)01720-8.CrossRef Donald H, Robert DG: MIXOR: a computer program for mixed-effects ordinal regression analysis. Computer Methods and Programs in Biomedicine. 1996, 49: 157-176. 10.1016/0169-2607(96)01720-8.CrossRef
21.
go back to reference Lesaffre E, Spiessens B: On the effect of the number of quadrature point in a logistic random-effects model: an example. Applied Statistics. 2001, 50: 325-335. Lesaffre E, Spiessens B: On the effect of the number of quadrature point in a logistic random-effects model: an example. Applied Statistics. 2001, 50: 325-335.
22.
go back to reference Goldstein H: Restricted (unbiased) Iterative Generalised Least Squares Estimation. Biometrika. 1989, 76: 622-623. 10.1093/biomet/76.3.622.CrossRef Goldstein H: Restricted (unbiased) Iterative Generalised Least Squares Estimation. Biometrika. 1989, 76: 622-623. 10.1093/biomet/76.3.622.CrossRef
23.
go back to reference Rodríguez G, Goldman N: An assessment of estimation procedures for multilevel models with binary responses. Journal of the Royal Statistical Society, Series A. 1995, 158: 73-89.CrossRef Rodríguez G, Goldman N: An assessment of estimation procedures for multilevel models with binary responses. Journal of the Royal Statistical Society, Series A. 1995, 158: 73-89.CrossRef
24.
go back to reference Lin X, Breslow NE: Bias correction in generalised linear mixed models with multiple components of dispersion. Journal of the American Statistical Association. 1996, 91: 1007-1016. 10.2307/2291720.CrossRef Lin X, Breslow NE: Bias correction in generalised linear mixed models with multiple components of dispersion. Journal of the American Statistical Association. 1996, 91: 1007-1016. 10.2307/2291720.CrossRef
25.
go back to reference Ng ESW, Carpenter JR, Goldstein H, Rasbash J: Estimation in generalised linear mixed models with binary outcomes by simulated maximum likelihood. Statistical Modelling. 2006, 6: 23-42. 10.1191/1471082X06st106oa.CrossRef Ng ESW, Carpenter JR, Goldstein H, Rasbash J: Estimation in generalised linear mixed models with binary outcomes by simulated maximum likelihood. Statistical Modelling. 2006, 6: 23-42. 10.1191/1471082X06st106oa.CrossRef
26.
go back to reference Spiegelhalter D, Thomas A, Best N, Lunn D: WinBUGS User Manual. 2007, version 1.4.3 Spiegelhalter D, Thomas A, Best N, Lunn D: WinBUGS User Manual. 2007, version 1.4.3
27.
go back to reference Browne W: MCMC estimation in MLwiN. 2009, version 2.13 Browne W: MCMC estimation in MLwiN. 2009, version 2.13
28.
go back to reference Hadfield J: MCMC methods for Multi-response Generalised Linear Mixed Models: The MCMCglmm R Package. Journal of Statistical Software. 2010, 33: 1-22.CrossRef Hadfield J: MCMC methods for Multi-response Generalised Linear Mixed Models: The MCMCglmm R Package. Journal of Statistical Software. 2010, 33: 1-22.CrossRef
29.
go back to reference The MCMC procedure. SAS/STAT User's Guide. 2009, Version 9.2 The MCMC procedure. SAS/STAT User's Guide. 2009, Version 9.2
30.
go back to reference Brooks SP, Gelman A: Alternative methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics. 1998, 7: 434-455. 10.2307/1390675. Brooks SP, Gelman A: Alternative methods for monitoring convergence of iterative simulations. Journal of Computational and Graphical Statistics. 1998, 7: 434-455. 10.2307/1390675.
31.
go back to reference Gelman A, Carlin JB, Stern HS, Rubin DB: Bayesian Data Analysis. 2003, New York, Chapman & Hall/CRC, [Chapter 11.6 Inference and assessing convergence], 2 Gelman A, Carlin JB, Stern HS, Rubin DB: Bayesian Data Analysis. 2003, New York, Chapman & Hall/CRC, [Chapter 11.6 Inference and assessing convergence], 2
33.
go back to reference Browne W, Draper D: A comparison of Bayesian and likelihood-based methods for fitting multilevel models. Bayesian Analysis. 2006, 1: 473-514.CrossRef Browne W, Draper D: A comparison of Bayesian and likelihood-based methods for fitting multilevel models. Bayesian Analysis. 2006, 1: 473-514.CrossRef
34.
go back to reference Gelman A: Prior distributions for variance parameters in hierarchical models. Bayesian Analysis. 2006, 1: 515-533.CrossRef Gelman A: Prior distributions for variance parameters in hierarchical models. Bayesian Analysis. 2006, 1: 515-533.CrossRef
35.
go back to reference Spiegelhalter DJ, Abrams KR, Myles JP: Bayesian approaches to clinical trials and health care evaluation. 2004, New York, Wiley Spiegelhalter DJ, Abrams KR, Myles JP: Bayesian approaches to clinical trials and health care evaluation. 2004, New York, Wiley
Metadata
Title
Logistic random effects regression models: a comparison of statistical packages for binary and ordinal outcomes
Authors
Baoyue Li
Hester F Lingsma
Ewout W Steyerberg
Emmanuel Lesaffre
Publication date
01-12-2011
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2011
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-11-77

Other articles of this Issue 1/2011

BMC Medical Research Methodology 1/2011 Go to the issue