Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2021

Open Access 01-12-2021 | Research article

Revisiting methods for modeling longitudinal and survival data: Framingham Heart Study

Authors: Julius S. Ngwa, Howard J. Cabral, Debbie M. Cheng, David R. Gagnon, Michael P. LaValley, L. Adrienne Cupples

Published in: BMC Medical Research Methodology | Issue 1/2021

Login to get access

Abstract

Background

Statistical methods for modeling longitudinal and time-to-event data has received much attention in medical research and is becoming increasingly useful. In clinical studies, such as cancer and AIDS, longitudinal biomarkers are used to monitor disease progression and to predict survival. These longitudinal measures are often missing at failure times and may be prone to measurement errors. More importantly, time-dependent survival models that include the raw longitudinal measurements may lead to biased results. In previous studies these two types of data are frequently analyzed separately where a mixed effects model is used for the longitudinal data and a survival model is applied to the event outcome.

Methods

In this paper we compare joint maximum likelihood methods, a two-step approach and a time dependent covariate method that link longitudinal data to survival data with emphasis on using longitudinal measures to predict survival. We apply a Bayesian semi-parametric joint method and maximum likelihood joint method that maximizes the joint likelihood of the time-to-event and longitudinal measures. We also implement the Two-Step approach, which estimates random effects separately, and a classic Time Dependent Covariate Model. We use simulation studies to assess bias, accuracy, and coverage probabilities for the estimates of the link parameter that connects the longitudinal measures to survival times.

Results

Simulation results demonstrate that the Two-Step approach performed best at estimating the link parameter when variability in the longitudinal measure is low but is somewhat biased downwards when the variability is high. Bayesian semi-parametric and maximum likelihood joint methods yield higher link parameter estimates with low and high variability in the longitudinal measure. The Time Dependent Covariate method resulted in consistent underestimation of the link parameter. We illustrate these methods using data from the Framingham Heart Study in which lipid measurements and Myocardial Infarction data were collected over a period of 26 years.

Conclusions

Traditional methods for modeling longitudinal and survival data, such as the time dependent covariate method, that use the observed longitudinal data, tend to provide downwardly biased estimates. The two-step approach and joint models provide better estimates, although a comparison of these methods may depend on the underlying residual variance.
Appendix
Available only for authorised users
Literature
1.
go back to reference Prentice R. Covariate measurement errors and parameter estimation in a failure time regression model. Biometrika. 1982;69:331–42.CrossRef Prentice R. Covariate measurement errors and parameter estimation in a failure time regression model. Biometrika. 1982;69:331–42.CrossRef
2.
go back to reference Tsiatis A, Davidian M. Joint modeling of longitudinal and time-to-event data: an overview. Stat Sin. 2004;14(3):809-34. Tsiatis A, Davidian M. Joint modeling of longitudinal and time-to-event data: an overview. Stat Sin. 2004;14(3):809-34.
3.
go back to reference Ngwa JS, Cabral HJ, Cheng DM, et al. A comparison of time dependent Cox regression, pooled logistic regression and cross sectional pooling with simulations and an application to the Framingham heart study. BMC Med Res Methodol. 2016;16:148.CrossRef Ngwa JS, Cabral HJ, Cheng DM, et al. A comparison of time dependent Cox regression, pooled logistic regression and cross sectional pooling with simulations and an application to the Framingham heart study. BMC Med Res Methodol. 2016;16:148.CrossRef
4.
go back to reference Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59:221–8.CrossRef Brown ER, Ibrahim JG. A Bayesian semiparametric joint hierarchical model for longitudinal and survival data. Biometrics. 2003;59:221–8.CrossRef
5.
go back to reference Zeng D, Cai J. Simultaneous modelling of survival and longitudinal data with an application to repeated quality of life measures. Lifetime Data Anal. 2005;11:151–74.CrossRef Zeng D, Cai J. Simultaneous modelling of survival and longitudinal data with an application to repeated quality of life measures. Lifetime Data Anal. 2005;11:151–74.CrossRef
6.
go back to reference Tseng Y-K, Hsieh F, Wang J-L. Joint modelling of accelerated failure time and longitudinal data. Biometrika. 2005;92:587–603.CrossRef Tseng Y-K, Hsieh F, Wang J-L. Joint modelling of accelerated failure time and longitudinal data. Biometrika. 2005;92:587–603.CrossRef
7.
go back to reference Ye W, Lin X, Taylor JM. Semiparametric modeling of longitudinal measurements and time-to-event data–a two-stage regression calibration approach. Biometrics. 2008;64:1238–46.CrossRef Ye W, Lin X, Taylor JM. Semiparametric modeling of longitudinal measurements and time-to-event data–a two-stage regression calibration approach. Biometrics. 2008;64:1238–46.CrossRef
8.
go back to reference Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. J Clin Oncol. 2010;28:2796–801.CrossRef Ibrahim JG, Chu H, Chen LM. Basic concepts and methods for joint models of longitudinal and survival data. J Clin Oncol. 2010;28:2796–801.CrossRef
9.
go back to reference Rizopoulos DJM. An R package for the joint modelling of longitudinal and time-to-event data. Journal of Statistical Software (Online). 2010;35:1–33. Rizopoulos DJM. An R package for the joint modelling of longitudinal and time-to-event data. Journal of Statistical Software (Online). 2010;35:1–33.
10.
go back to reference Robins J, Tsiatis AA. Semiparametric estimation of an accelerated failure time model with time-dependent covariates. Biometrika. 1992;79:311–9. Robins J, Tsiatis AA. Semiparametric estimation of an accelerated failure time model with time-dependent covariates. Biometrika. 1992;79:311–9.
11.
go back to reference De De Gruttola V, Tu XM. Modelling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50(4):1003-14. De De Gruttola V, Tu XM. Modelling progression of CD4-lymphocyte count and its relationship to survival time. Biometrics. 1994;50(4):1003-14.
12.
go back to reference Tsiatis A, Degruttola V, Wulfsohn M. Modeling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS. J Am Stat Assoc. 1995;90:27–37.CrossRef Tsiatis A, Degruttola V, Wulfsohn M. Modeling the relationship of survival to longitudinal data measured with error. Applications to survival and CD4 counts in patients with AIDS. J Am Stat Assoc. 1995;90:27–37.CrossRef
13.
go back to reference Lavalley MP, Degruttola V. Models for empirical Bayes estimators of longitudinal CD4 counts. Stat Med. 1996;15:2289–305.CrossRef Lavalley MP, Degruttola V. Models for empirical Bayes estimators of longitudinal CD4 counts. Stat Med. 1996;15:2289–305.CrossRef
14.
go back to reference Faucett CL, Thomas DC. Simultaneously modelling censored survival data and repeatedly measured covariates: a Gibbs sampling approach. Stat Med. 1996;15:1663–85.CrossRef Faucett CL, Thomas DC. Simultaneously modelling censored survival data and repeatedly measured covariates: a Gibbs sampling approach. Stat Med. 1996;15:1663–85.CrossRef
15.
go back to reference Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53(1):330-9. Wulfsohn MS, Tsiatis AA. A joint model for survival and longitudinal data measured with error. Biometrics. 1997;53(1):330-9.
16.
go back to reference Wang Y, Taylor JMG. Jointly modeling longitudinal and event time data with application to acquired immunodeficiency syndrome. J Am Stat Assoc. 2001;96:895–905.CrossRef Wang Y, Taylor JMG. Jointly modeling longitudinal and event time data with application to acquired immunodeficiency syndrome. J Am Stat Assoc. 2001;96:895–905.CrossRef
17.
go back to reference Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. J R Stat Soc Ser C (Applied Statistics). 2001;50:375–87.CrossRef Xu J, Zeger SL. Joint analysis of longitudinal data comprising repeated measures and times to events. J R Stat Soc Ser C (Applied Statistics). 2001;50:375–87.CrossRef
18.
go back to reference Sweeting MJ, Thompson SG. Joint modelling of longitudinal and time-to-event data with application to predicting abdominal aortic aneurysm growth and rupture. Biom J. 2011;53:750–63.CrossRef Sweeting MJ, Thompson SG. Joint modelling of longitudinal and time-to-event data with application to predicting abdominal aortic aneurysm growth and rupture. Biom J. 2011;53:750–63.CrossRef
19.
go back to reference Therneau TM and Grambsch PM. Modeling survival data: extending the Cox model. Springer Science & Business Media, 2013. Therneau TM and Grambsch PM. Modeling survival data: extending the Cox model. Springer Science & Business Media, 2013.
21.
go back to reference Cox DR. Models and life-tables regression. JR Stat Soc Ser B. 1972;34:187–220. Cox DR. Models and life-tables regression. JR Stat Soc Ser B. 1972;34:187–220.
22.
go back to reference Laird NM, Ware JH. Random-effects models for longitudinal data. Biometrics. 1982;38(4):963-74. Laird NM, Ware JH. Random-effects models for longitudinal data. Biometrics. 1982;38(4):963-74.
23.
go back to reference Little RJA, Rubin DB. Statistical analysis with missing data. Vol. 793. Wiley; 2019. Little RJA, Rubin DB. Statistical analysis with missing data. Vol. 793. Wiley; 2019.
24.
go back to reference Piessens R, de Doncker-Kapenga E, Überhuber CW and Kahaner DK. QUADPACK: a subroutine package for automatic integration. Springer Science & Business Media, 2012. Piessens R, de Doncker-Kapenga E, Überhuber CW and Kahaner DK. QUADPACK: a subroutine package for automatic integration. Springer Science & Business Media, 2012.
25.
go back to reference Herzet C, Wautelet X, Ramon V, Vandendorpe L. Iterative synchronization: EM algorithm versus Newton-Raphson method. In: Acoustics, Speech and Signal Processing, 2006 ICASSP 2006 Proceedings 2006 IEEE International Conference on: IEEE; 2006. p. IV. Herzet C, Wautelet X, Ramon V, Vandendorpe L. Iterative synchronization: EM algorithm versus Newton-Raphson method. In: Acoustics, Speech and Signal Processing, 2006 ICASSP 2006 Proceedings 2006 IEEE International Conference on: IEEE; 2006. p. IV.
26.
go back to reference Venables WN, Smith DM. The R development core team. Vienna, Austria: An Introduction to R R Foundation for Statistical Computing; 2006. Venables WN, Smith DM. The R development core team. Vienna, Austria: An Introduction to R R Foundation for Statistical Computing; 2006.
27.
go back to reference Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS-a Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput. 2000;10:325–37.CrossRef Lunn DJ, Thomas A, Best N, Spiegelhalter D. WinBUGS-a Bayesian modelling framework: concepts, structure, and extensibility. Stat Comput. 2000;10:325–37.CrossRef
28.
go back to reference Cox DR, Oakes D. Analysis of survival data. Vol. 21. Germany: Taylor & Francis; 1984. Cox DR, Oakes D. Analysis of survival data. Vol. 21. Germany: Taylor & Francis; 1984.
30.
go back to reference Burton A, Altman DG, Royston P, Holder RL. The design of simulation studies in medical statistics. Stat Med. 2006;25:4279–92.CrossRef Burton A, Altman DG, Royston P, Holder RL. The design of simulation studies in medical statistics. Stat Med. 2006;25:4279–92.CrossRef
31.
go back to reference Dafni UG, Tsiatis AA. Evaluating surrogate markers of clinical outcome when measured with error. Biometrics. 1998;54(4):1445-62. Dafni UG, Tsiatis AA. Evaluating surrogate markers of clinical outcome when measured with error. Biometrics. 1998;54(4):1445-62.
32.
go back to reference Bender R, Augustin T, Blettner M. Generating survival times to simulate Cox proportional hazards models. Stat Med. 2005;24:1713–23.CrossRef Bender R, Augustin T, Blettner M. Generating survival times to simulate Cox proportional hazards models. Stat Med. 2005;24:1713–23.CrossRef
Metadata
Title
Revisiting methods for modeling longitudinal and survival data: Framingham Heart Study
Authors
Julius S. Ngwa
Howard J. Cabral
Debbie M. Cheng
David R. Gagnon
Michael P. LaValley
L. Adrienne Cupples
Publication date
01-12-2021
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2021
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-021-01207-y

Other articles of this Issue 1/2021

BMC Medical Research Methodology 1/2021 Go to the issue