Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2019

Open Access 01-12-2019 | Technical advance

Heckman-type selection models to obtain unbiased estimates with missing measures outcome: theoretical considerations and an application to missing birth weight data

Authors: Siaka Koné, Bassirou Bonfoh, Daouda Dao, Inza Koné, Günther Fink

Published in: BMC Medical Research Methodology | Issue 1/2019

Login to get access

Abstract

Background

In low-income settings, key outcomes such as biomarkers or clinical assessments are often missing for a substantial proportion of the study population. The aim of this study was to assess the extent to which Heckman-type selection models can create unbiased estimates in such settings.

Methods

We introduce the basic Heckman model in a first stage, and then use simulation models to compare the performance of the model to alternative approaches used in the literature for missing outcome data, including complete case analysis (CCA), multiple imputations by chained equations (MICE) and pattern imputation with delta adjustment (PIDA). Last, we use a large population-representative data set on antenatal supplementation (AS) and birth outcomes from Côte d’Ivoire to illustrate the empirical relevance of this method.

Results

All models performed well when data were missing at random. When missingness in the outcome data was related to unobserved determinants of the outcome, large and systematic biases were found for CCA and MICE, while Heckman-style selection models yielded unbiased estimates. Using Heckman-type selection models to correct for missingness in our empirical application, we found supplementation effect sizes that were very close to those reported in the most recent systematic review of clinical AS trials.

Conclusion

Missingness in health outcome can lead to substantial bias. Heckman-selection models can correct for this selection bias and yield unbiased estimates, even when the proportion of missing data is substantial.
Appendix
Available only for authorised users
Literature
1.
go back to reference Britton A, McKee M, Black N, McPherson K, Sanderson C. Choosing between randomised and non-randomised studies: a systematic review. Health Technol Assess. 1998;2(13), pp. i–iv, 1-124. Britton A, McKee M, Black N, McPherson K, Sanderson C. Choosing between randomised and non-randomised studies: a systematic review. Health Technol Assess. 1998;2(13), pp. i–iv, 1-124.
2.
go back to reference Black N. Why we need observational studies to evaluate the effectiveness of health care. BMJ. 1996;312:1215–8.CrossRef Black N. Why we need observational studies to evaluate the effectiveness of health care. BMJ. 1996;312:1215–8.CrossRef
3.
go back to reference Benson K, Hartz AJ. A comparison of observational and randomized controlled trials. N Engl J Med. 2000;342:1878–86.CrossRef Benson K, Hartz AJ. A comparison of observational and randomized controlled trials. N Engl J Med. 2000;342:1878–86.CrossRef
4.
go back to reference Crawford SL, Tennstedr SL, Mckinlay JB. A comparison of analysis methods for non-random missingness of outcome data. J Clin Epidemiol. 1995;48:209–19.CrossRef Crawford SL, Tennstedr SL, Mckinlay JB. A comparison of analysis methods for non-random missingness of outcome data. J Clin Epidemiol. 1995;48:209–19.CrossRef
5.
go back to reference Donders AR, van der Heijden GJ, Stijnen T, Moons KG. Review: a gentle introduction to imputation of missing values. J Clin Epidemiol. 2006;59:1087–91.CrossRef Donders AR, van der Heijden GJ, Stijnen T, Moons KG. Review: a gentle introduction to imputation of missing values. J Clin Epidemiol. 2006;59:1087–91.CrossRef
6.
go back to reference Ratitch B, O'Kelly M, Tosiello R. Missing data in clinical trials: from clinical assumptions to statistical analysis using pattern mixture models. Pharm Stat. 2013;12:337–47.CrossRef Ratitch B, O'Kelly M, Tosiello R. Missing data in clinical trials: from clinical assumptions to statistical analysis using pattern mixture models. Pharm Stat. 2013;12:337–47.CrossRef
7.
go back to reference Heckman J. Sample selection bias as a specification error. Econometrica. 1979;47:153–61.CrossRef Heckman J. Sample selection bias as a specification error. Econometrica. 1979;47:153–61.CrossRef
8.
go back to reference Brämer GR. International statistical classification of diseases and related health problems. Tenth revision. World Health Stat Q. 1988;41:32–6. Brämer GR. International statistical classification of diseases and related health problems. Tenth revision. World Health Stat Q. 1988;41:32–6.
9.
go back to reference Barker DJP. Fetal and infant origins of disease. London: BMJ Books; 1992. Barker DJP. Fetal and infant origins of disease. London: BMJ Books; 1992.
11.
go back to reference Imdad A, Bhutta ZA. Routine iron/folate supplementation during pregnancy: effect on maternal anaemia and birth outcomes. Paediatr Perinat Epidemiol. 2012;26:168–77.CrossRef Imdad A, Bhutta ZA. Routine iron/folate supplementation during pregnancy: effect on maternal anaemia and birth outcomes. Paediatr Perinat Epidemiol. 2012;26:168–77.CrossRef
13.
go back to reference Balarajan Y, Subramanian SV, Fawzi WW. Maternal iron and folic acid supplementation is associated with lower risk of low birth weight in India. The Journal of nutrition. 2013;143:1309-1315CrossRef Balarajan Y, Subramanian SV, Fawzi WW. Maternal iron and folic acid supplementation is associated with lower risk of low birth weight in India. The Journal of nutrition. 2013;143:1309-1315CrossRef
14.
go back to reference Palma S, Perez-Iglesias R, Prieto D, et al. Iron but not folic acid supplementation reduces the risk of low birthweight in pregnant women without anaemia: a case–control study. J Epidemiol Community Health. 2008;62:120–4.CrossRef Palma S, Perez-Iglesias R, Prieto D, et al. Iron but not folic acid supplementation reduces the risk of low birthweight in pregnant women without anaemia: a case–control study. J Epidemiol Community Health. 2008;62:120–4.CrossRef
15.
go back to reference Miranda A, Rabe-Hesketh S. Maximum likelihood estimation of endogenous switching and sample selection models for binary, ordinal, and count variables. Stata J. 2006;6:285–308.CrossRef Miranda A, Rabe-Hesketh S. Maximum likelihood estimation of endogenous switching and sample selection models for binary, ordinal, and count variables. Stata J. 2006;6:285–308.CrossRef
16.
go back to reference Davidson RG, Shea R, Kiersten J, Eldaw S, Adam W, Agbessi A. Socio-economic differences in health, nutrition, and population within developing countries. Washington DC: The World Bank, 20433; 2007. p. 1–4. Davidson RG, Shea R, Kiersten J, Eldaw S, Adam W, Agbessi A. Socio-economic differences in health, nutrition, and population within developing countries. Washington DC: The World Bank, 20433; 2007. p. 1–4.
18.
go back to reference Royston P. ICE: Stata module for multiple imputation of missing values; 2006. Statistical Software Components S446602, Boston College Department of Economics, revised 25 Oct 2014 Royston P. ICE: Stata module for multiple imputation of missing values; 2006. Statistical Software Components S446602, Boston College Department of Economics, revised 25 Oct 2014
19.
go back to reference Koné S, Baikoro N, N'Guessan Y, Jaeger FN, Silué KD, Fürst T, et al. Health & Demographic Surveillance System Profile: the Taabo health and demographic surveillance system, Côte d'Ivoire. Int J Epidemiol. 2015;44:87–97. Koné S, Baikoro N, N'Guessan Y, Jaeger FN, Silué KD, Fürst T, et al. Health & Demographic Surveillance System Profile: the Taabo health and demographic surveillance system, Côte d'Ivoire. Int J Epidemiol. 2015;44:87–97.
20.
go back to reference Koné S, Fürst T, Jaeger FN, Esso EL, Baikoro N, Kouadio KA, et al. Causes of death in the Taabo health and demographic surveillance system, Côte d'Ivoire, from 2009 to 2011. Glob Health Action. 2015;8:27271.CrossRef Koné S, Fürst T, Jaeger FN, Esso EL, Baikoro N, Kouadio KA, et al. Causes of death in the Taabo health and demographic surveillance system, Côte d'Ivoire, from 2009 to 2011. Glob Health Action. 2015;8:27271.CrossRef
22.
go back to reference Phillips JF, Macleod BB, Pence B. The household registration system: computer software for the rapid dissemination of demographic surveillance systems. Demogr Res. 2000;2:1–40. Phillips JF, Macleod BB, Pence B. The household registration system: computer software for the rapid dissemination of demographic surveillance systems. Demogr Res. 2000;2:1–40.
23.
go back to reference McGovern ME, Bärnighausen T, Marra G, Radice R. On the assumption of bivariate normality in selection models: a copula approach applied to estimating HIV prevalence. Epidimiology. 2015;26(2):229–37.CrossRef McGovern ME, Bärnighausen T, Marra G, Radice R. On the assumption of bivariate normality in selection models: a copula approach applied to estimating HIV prevalence. Epidimiology. 2015;26(2):229–37.CrossRef
24.
go back to reference Newey WK. Two-step series estimation of sample selection models. The Econometrics Journal. 2009;12(s1):S217–29.CrossRef Newey WK. Two-step series estimation of sample selection models. The Econometrics Journal. 2009;12(s1):S217–29.CrossRef
25.
go back to reference Mishra V, Thapa S, Retherford RD, Dai X. Effect of iron supplementation during pregnancy on birthweight: evidence from Zimbabwe. Food Nutr Bull. 2005;26:338–47.CrossRef Mishra V, Thapa S, Retherford RD, Dai X. Effect of iron supplementation during pregnancy on birthweight: evidence from Zimbabwe. Food Nutr Bull. 2005;26:338–47.CrossRef
26.
go back to reference Peña-Rosas JP, De-Regil LM, Garcia-Casal MN, Dowswell T. Daily oral iron supplementation during pregnancy. Cochrane Database Syst Rev. 2015;7:1–544. Peña-Rosas JP, De-Regil LM, Garcia-Casal MN, Dowswell T. Daily oral iron supplementation during pregnancy. Cochrane Database Syst Rev. 2015;7:1–544.
Metadata
Title
Heckman-type selection models to obtain unbiased estimates with missing measures outcome: theoretical considerations and an application to missing birth weight data
Authors
Siaka Koné
Bassirou Bonfoh
Daouda Dao
Inza Koné
Günther Fink
Publication date
01-12-2019
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2019
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-019-0840-7

Other articles of this Issue 1/2019

BMC Medical Research Methodology 1/2019 Go to the issue