Program Impact Estimation with Binary Outcome Variables: Monte Carlo Results for Alternative Estimators and Empirical Examples

Guilkey, David K.; Lance, Peter M.

doi:10.1007/978-1-4899-8008-3_2

David K. Guilkey³ &
Peter M. Lance⁴

22 Citations

Abstract

A frequent challenge in program impact estimation, and causal modeling more generally, is estimation of the effect of a binary endogenous variable on a binary outcome of interest. We report results from Monte Carlo experiments designed to assess the performance of estimators frequently applied in this circumstance. Many rely on an instrumental variables identification strategy and in those instances our central interest is the overidentified case. Even when identification is technically achieved by functional form, it is widely perceived that instruments generate more credible identification. Our focus is on widely used models available in the popular STATA statistical software package, but we also evaluate a semi-parametric instrumental variables random effects model not yet available in STATA. The parameters of interest in these experiments are program impact, test statistics assessing endogeneity and overidentification tests. We consider performance under alternative behavioral circumstances by varying distributional assumptions for unobservables, instrument strength levels, sample sizes, and impact magnitudes. Some models turn in a somewhat disappointing performance. Those that rely on joint normality for identification are not particularly robust to error misspecification, raising questions about whether they should be preferred to the semi-parametric estimator (regardless of comparative ease of estimation) or even to simple single equation models that ignore endogeneity. We provide examples of the methods using data from Bangladesh and Tanzania.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

eBook: USD 16.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
The authors are currently writing STATA commands to implement this estimator.
2.
We did consider predictor substitution schemes as well but, as expected, they performed poorly and we do not include them in the comparisons.
3.
That is, the program enrollment prevalence within the sample.
4.
We are grateful to Stas Kolenikov for generously sharing a STATA.ado file that he wrote implementing that Vale and Maurelli (1983) procedure.
5.
Experimentation suggests that variation in the values assigned to these coefficient terms had very little impact on the statistics of interest in this study.
6.
Step 1 was actually slightly more involved. It became apparent in early rounds of experiments that some behavioral parameters, particularly instrument strength, occasionally varied across replications to a degree with which the authors were not comfortable. In particular, the various replications from experiments involving first stage χ ² statistics with target values of 15 and 25 occasionally produced overlapping ranges for the χ ² statistic values actually generated across the replications for the two experiments. This muddied the waters somewhat for the purposes of making inferences about estimator performance differentials as instrument strength varied. To address this, we set tolerance bands for acceptable variation of such χ ² values around their target for a given experiment. If, on a particular replication, a draw {ε ₁, ε ₂} resulted in a χ ² value outside of the tolerance range for that experiment, that draw was discarded and a new draw {ε ₁, ε ₂} was made. This was done to insure that the replications within an experiment conformed to an acceptable degree to the parameters of that experiment.
7.
As explained in Sect. 2.3, the behavioral parameters are imposed by the design of the data generating process for each experiment and included the: program effect (\(Pr(Y _{2}\vert X,Y _{1} = 1) - Pr(Y _{2}\vert X,Y _{1} = 0)\)); correlation of the errors {ε ₁, ε ₂}; average of the program outcome (Y ₁) within the sample; average of the outcome of interest (Y ₂) within the sample; first stage strength of the instruments Z to explain Y ₁ (as reflected in the χ ² statistic emerging from a test of the joint significance of those instruments); and bivariate error type (i.e. normal or a non-normal errors).
8.
Recall that the overidentification test statistic for the bivariate probit model is simply the χ ² statistic for a test of the joint significance of the instruments in the marginal probit equation for Y ₂ under the “just identified” specification under which the instruments appear in both marginal probit equations and identification rests on nonlinearity from functional form (i.e. joint normality) alone. The null hypothesis of such a test is that the instruments are not jointly significant regressors in marginal probit equation for Y ₂ (i.e. that they are legitimately excluded from the marginal probit equation for Y ₂).
9.
We refer to the Wu-Hausman test (Wu 1974; Hausman 1978) simply as “Wu” in Tables 2.24 and 2.25.

References

Anderson T, Rubin H (1950) The asymptotic properties of estimates of the parameters of a single equation in a complete system of stochastic equations. Ann Math Stat 21:570–582
Article Google Scholar
Angrist J, Krueger A (2001) Instrumental variables and the search for identification: from supply and demand to natural experiments. J Econ Perspect 15:69–85
Article Google Scholar
Angrist J, Pischke J (2009) Mostly harmless econometrics: an empiricist’s companion. Princeton University Press, Princeton
Google Scholar
Babalola S (2005) Communication, ideation and contraceptive use in Burkina Faso: an application of the propensity score matching method. J Fam Plan Reprod Health Care 31:207–212
Article Google Scholar
Bassman R (1960) On finite sample distributions of generalized classical linear identifiability test statistics. J Am Stat Assoc 55:650–659
Article Google Scholar
Bauman K, Viadro C, Tsui A (1993) Family planning program effects in developing countries: conclusions and related considerations. The evaluation project working paper IM-03-03
Google Scholar
Bollen K, Guilkey D, Mroz T (1995) Binary outcomes and endogenous explanatory variables: tests and solutions with an application to the demand for contraceptive use in Tunisia. Demography 32:111–131
Article Google Scholar
Bound J, Jaeger D, Baker R (1995) Problems with instrumental variables estimation when the correlation between the instruments and the endogenous explanatory variable is weak. J Am Stat Assoc 90:443–450
Google Scholar
Cappellari L, Jenkins S (2003) Multivariate probit regression using simulated maximum likelihood. STATA J 3:278–294
Google Scholar
Chen S, Guilkey D (2003) Determinants of contraceptive method choice in rural Tanzania between 1991 and 1999. Stud Fam Plan 34:263–276
Article Google Scholar
Chiburis RC, Das J, Lokshin M (2011) A practical comparison of the bivariate probit and linear IV estimators. The World Bank Policy research working paper 5601
Google Scholar
Durbin J (1954) Errors in variables. Rev Int Stat Inst 22:23–32
Article Google Scholar
Fleishman A (1978) A method for simulating nonnormal distributions. Psychometrika 43:521–532
Article Google Scholar
Gourieroux C, Monfort A, Renault E, Trognon A (1987) Generalized residuals. J Econom 34:5–32
Article Google Scholar
Guilkey D, Hutchinson P (2011) Overcoming methodological challenges in evaluating health communication campaigns: evidence from rural Bangladesh. Stud Fam Plan 42:93–106
Article Google Scholar
Guilkey D, Mroz T, Taylor L (1992) Estimation and testing in simultaneous equations models with discrete outcomes using cross section data. UNC-CH Department of Economics working paper
Google Scholar
Guilkey D, Hutchinson P, Lance P (2006) Cost effectiveness analysis for health communications programs. J Health Commun 11:47–67
Article Google Scholar
Hansen L (1982) Large sample properties of generalized method of moments estimators. Econometrica 50:1029–1054
Article Google Scholar
Hausman J (1978) Specification tests in econometrics. Econometrica 46:1251–1271
Article Google Scholar
Hayashi F (2000) Econometrics. Princeton University Press, Princeton
Google Scholar
Heckman J, Singer B (1984) A method for minimizing the impact of distributional assumptions in econometric models for duration data. Econometrica 52:271–320
Article Google Scholar
Hutchinson P, Wheeler J (2006). The cost effectiveness of health communication programs: what do we know? J Health Commun 11:7–45
Article Google Scholar
Imbens G, Angrist J (1994) Indentification and estimation of local average treatment effects. Econometrica 62:467–475
Article Google Scholar
Kaiser H, Dickman K (1962) Sample and population score matrices and sample correlation matrices from an arbitrary population correlation matrix. Psychometrika 27:179–182
Article Google Scholar
LaLonde R (1986) Evaluating the econometric evaluations of training programs with experimental data. Am Econ Rev 76:604–620
Google Scholar
Manning W, Duan N, Rogers W (1987) Monte Carlo evidence on the choice between sample selection and two-part models. J Econom 35:59–82
Article Google Scholar
Mwaikambo L, Speizer I, Schurmann A, Morgan G, Fikree F (2011) What works in family planning interventions: a systematic review. Stud Fam Plan 42:67–82
Article Google Scholar
Mroz T (1999) Discrete factor approximations in simultaneous equations models: estimating the impact of a dummy endogenous variable on a continuous outcome. J Econom 92:233–274
Article Google Scholar
Ngallaba S, Kapiga S, Ruyoba I, Boerma J (1993) Tanzania demographic and health survey 1991/1992. Macro International Inc., Columbia
Google Scholar
Rivers D, Vuong Q (1988) Limited information estimators and exogeneity tests for simultaneous probit models. J Econom 39:347–366
Article Google Scholar
Sargon J (1958) The estimation of economic relationships using instrumental variables. Econometrica 26:393–415
Article Google Scholar
Stock J, Staiger D (1997) Instrumental variables regression with weak instruments. Econometrica 65:557–586
Article Google Scholar
Terza J, Basu A, Rathouz P (2008) Two-stage residual inclusion estimation: addressing endogeneity in health econometric modelling. J Health Econ 27:531–543
Article Google Scholar
Vale C, Maurelli V (1983) Simulating multivariate nonnormal distributions. Psychometrika 48:465–471
Article Google Scholar
Wu D (1974) Alternative tests of independence between stochastic regressors and disturbances: finite sample results. Econometrica 42:529–546
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Economics and the Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27514, USA
David K. Guilkey
Carolina Population Center, University of North Carolina at Chapel Hill, Chapel Hill, NC, 27516-3997, USA
Peter M. Lance

Authors

David K. Guilkey
View author publications
You can also search for this author in PubMed Google Scholar
Peter M. Lance
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to David K. Guilkey .

Editor information

Editors and Affiliations

Department of Economics, Rice University, Houston, Texas, USA
Robin C. Sickles
Department of Economics, Syracuse University, Syracuse, New York, USA
William C. Horrace

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Guilkey, D.K., Lance, P.M. (2014). Program Impact Estimation with Binary Outcome Variables: Monte Carlo Results for Alternative Estimators and Empirical Examples. In: Sickles, R., Horrace, W. (eds) Festschrift in Honor of Peter Schmidt. Springer, New York, NY. https://doi.org/10.1007/978-1-4899-8008-3_2

Download citation

DOI: https://doi.org/10.1007/978-1-4899-8008-3_2
Published: 05 February 2014
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4899-8007-6
Online ISBN: 978-1-4899-8008-3
eBook Packages: Business and EconomicsEconomics and Finance (R0)

Publish with us

Policies and ethics