Skip to main content
Top
Published in: Current Anesthesiology Reports 3/2016

01-09-2016 | Research Methods and Statistical Analyses (Y Le Manach, Section Editor)

Causal Inference in Anesthesia and Perioperative Observational Studies

Authors: Tri-Long Nguyen, Audrey Winter, Jessica Spence, Géraldine Leguelinel-Blache, Paul Landais, Yannick Le Manach

Published in: Current Anesthesiology Reports | Issue 3/2016

Login to get access

Abstract

Purpose of Review

Observational studies are of great importance to anesthesia and perioperative care research, as they reflect routine clinical practice. However, because observational data are nonexperimental, assigning causality to identified relationships has a significant risk of bias. After describing the pros and cons of observational studies, we provide an overview of the different methods used to make causal inferences. Of these, we focus on the propensity score analysis, which achieves an increasing popularity in anesthesia and perioperative literature.

Recent Findings

Several methods are proposed for estimating treatment effects in observational studies. Although multivariable regression has traditionally been used to infer causal effects by adjusting for confounding variables, the reported result mainly depends on the model specification that fits the researcher’s hypothesis. Preprocessing observational data can reduce this model dependence, by balancing confounders across the treatment groups like in experimental studies. In particular, the propensity score analysis approximates the randomized controlled trial.

Summary

Compared to randomized experiments, observational studies are low-cost sources of “real-life” data, but they are exposed to bias. Treatment effects can be estimated by using appropriate methods, such as the propensity score analysis, which limits confounding and model-dependence bias. We provide an illustrative example of propensity score analysis using a recently published study, which assessed the outcomes after hip fracture surgery compared with elective total hip replacement.
Literature
1.
go back to reference Guyatt GH, Oxman AD, Kunz R, Vist GE, Falck-Ytter Y, Schunemann HJ, Group GW. What is “quality of evidence” and why is it important to clinicians? BMJ. 2008;336(7651):995–8. Guyatt GH, Oxman AD, Kunz R, Vist GE, Falck-Ytter Y, Schunemann HJ, Group GW. What is “quality of evidence” and why is it important to clinicians? BMJ. 2008;336(7651):995–8.
2.
go back to reference Naylor CD, Guyatt GH. Users’ guides to the medical literature. X. How to use an article reporting variations in the outcomes of health services. The evidence-based medicine working group. JAMA. 1996;275(7):554–8.PubMed Naylor CD, Guyatt GH. Users’ guides to the medical literature. X. How to use an article reporting variations in the outcomes of health services. The evidence-based medicine working group. JAMA. 1996;275(7):554–8.PubMed
3.
go back to reference Feinstein AR. Epidemiologic analyses of causation: the unlearned scientific lessons of randomized trials. J Clin Epidemiol. 1989;42(6):481–489; discussion 499–502.PubMed Feinstein AR. Epidemiologic analyses of causation: the unlearned scientific lessons of randomized trials. J Clin Epidemiol. 1989;42(6):481–489; discussion 499–502.PubMed
4.
go back to reference Altman DG, Bland JM. Statistics notes. Treatment allocation in controlled trials: why randomise? BMJ. 1999;318(7192):1209.PubMedPubMedCentral Altman DG, Bland JM. Statistics notes. Treatment allocation in controlled trials: why randomise? BMJ. 1999;318(7192):1209.PubMedPubMedCentral
5.
go back to reference •• Le Manach Y, Collins G, Bhandari M, Bessissow A, Boddaert J, Khiami F, Chaudhry H, De Beer J, Riou B, Landais P et al. Outcomes after hip fracture surgery compared with elective total hip replacement. JAMA. 2015;314(11):1159–66. In a large cohort of French patients, hip fracture surgery compared with elective total hip replacement is associated with a higher risk of in-hospital mortality after matching on age, sex and measured comorbidities. PubMed •• Le Manach Y, Collins G, Bhandari M, Bessissow A, Boddaert J, Khiami F, Chaudhry H, De Beer J, Riou B, Landais P et al. Outcomes after hip fracture surgery compared with elective total hip replacement. JAMA. 2015;314(11):1159–66. In a large cohort of French patients, hip fracture surgery compared with elective total hip replacement is associated with a higher risk of in-hospital mortality after matching on age, sex and measured comorbidities. PubMed
6.
go back to reference Vascular Events In Noncardiac Surgery Patients Cohort Evaluation Study I, Devereaux PJ, Chan MT, Alonso-Coello P, Walsh M, Berwanger O, Villar JC, Wang CY, Garutti RI, Jacka MJ et al. Association between postoperative troponin levels and 30-day mortality among patients undergoing noncardiac surgery. JAMA. 2012;307(21):2295–304. Vascular Events In Noncardiac Surgery Patients Cohort Evaluation Study I, Devereaux PJ, Chan MT, Alonso-Coello P, Walsh M, Berwanger O, Villar JC, Wang CY, Garutti RI, Jacka MJ et al. Association between postoperative troponin levels and 30-day mortality among patients undergoing noncardiac surgery. JAMA. 2012;307(21):2295–304.
7.
go back to reference Heckman J. Instrumental variables: a study of implicit behavioral assumptions used in making program evaluations. J Hum Resour. 1997;32(3):441–62. Heckman J. Instrumental variables: a study of implicit behavioral assumptions used in making program evaluations. J Hum Resour. 1997;32(3):441–62.
8.
go back to reference Gayat E, Pirracchio R, Resche-Rigon M, Mebazaa A, Mary JY, Porcher R. Propensity scores in intensive care and anaesthesiology literature: a systematic review. Intensive Care Med. 2010;36(12):1993–2003.PubMed Gayat E, Pirracchio R, Resche-Rigon M, Mebazaa A, Mary JY, Porcher R. Propensity scores in intensive care and anaesthesiology literature: a systematic review. Intensive Care Med. 2010;36(12):1993–2003.PubMed
9.
go back to reference Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–9.PubMed Peduzzi P, Concato J, Kemper E, Holford TR, Feinstein AR. A simulation study of the number of events per variable in logistic regression analysis. J Clin Epidemiol. 1996;49(12):1373–9.PubMed
10.
go back to reference Concato J, Peduzzi P, Holford TR, Feinstein AR. Importance of events per independent variable in proportional hazards analysis. I. Background, goals, and general strategy. J Clin Epidemiol. 1995;48(12):1495–501.PubMed Concato J, Peduzzi P, Holford TR, Feinstein AR. Importance of events per independent variable in proportional hazards analysis. I. Background, goals, and general strategy. J Clin Epidemiol. 1995;48(12):1495–501.PubMed
11.
go back to reference Peduzzi P, Concato J, Feinstein AR, Holford TR. Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. J Clin Epidemiol. 1995;48(12):1503–10.PubMed Peduzzi P, Concato J, Feinstein AR, Holford TR. Importance of events per independent variable in proportional hazards regression analysis. II. Accuracy and precision of regression estimates. J Clin Epidemiol. 1995;48(12):1503–10.PubMed
12.
go back to reference Cepeda MS. Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. Am J Epidemiol. 2003;158(3):280–7.PubMed Cepeda MS. Comparison of logistic regression versus propensity score when the number of events is low and there are multiple confounders. Am J Epidemiol. 2003;158(3):280–7.PubMed
13.
go back to reference Vittinghoff E, McCulloch CE. Relaxing the rule of ten events per variable in logistic and Cox regression. Am J Epidemiol. 2007;165(6):710–8.PubMed Vittinghoff E, McCulloch CE. Relaxing the rule of ten events per variable in logistic and Cox regression. Am J Epidemiol. 2007;165(6):710–8.PubMed
14.
go back to reference Austin PC, Steyerberg EW. The number of subjects per variable required in linear regression analyses. J Clin Epidemiol. 2015;68(6):627–36.PubMed Austin PC, Steyerberg EW. The number of subjects per variable required in linear regression analyses. J Clin Epidemiol. 2015;68(6):627–36.PubMed
15.
go back to reference Austin PC, Tu JV. Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality. J Clin Epidemiol. 2004;57(11):1138–46.PubMed Austin PC, Tu JV. Automated variable selection methods for logistic regression produced unstable models for predicting acute myocardial infarction mortality. J Clin Epidemiol. 2004;57(11):1138–46.PubMed
16.
go back to reference Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55. Rosenbaum PR, Rubin DB. The central role of the propensity score in observational studies for causal effects. Biometrika. 1983;70(1):41–55.
17.
go back to reference Austin PC, Grootendorst P, Anderson GM. A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study. Stat Med. 2007;26(4):734–53.PubMed Austin PC, Grootendorst P, Anderson GM. A comparison of the ability of different propensity score models to balance measured variables between treated and untreated subjects: a Monte Carlo study. Stat Med. 2007;26(4):734–53.PubMed
18.
go back to reference Pearl J. Invited commentary: understanding bias amplification. Am J Epidemiol. 2011;174(11):1223–7; discussion 1228–9.PubMedPubMedCentral Pearl J. Invited commentary: understanding bias amplification. Am J Epidemiol. 2011;174(11):1223–7; discussion 1228–9.PubMedPubMedCentral
19.
go back to reference Westreich D, Cole SR, Funk MJ, Brookhart MA, Sturmer T. The role of the c-statistic in variable selection for propensity score models. Pharmacoepidemiol Drug Saf. 2011;20(3):317–20.PubMed Westreich D, Cole SR, Funk MJ, Brookhart MA, Sturmer T. The role of the c-statistic in variable selection for propensity score models. Pharmacoepidemiol Drug Saf. 2011;20(3):317–20.PubMed
20.
go back to reference •• Ali MS, Groenwold RH, Belitser SV, Pestman WR, Hoes AW, Roes KC, Boer A, Klungel OH. Reporting of covariate selection and balance assessment in propensity score analysis is suboptimal: a systematic review. J Clin Epidemiol. 2015;68(2):112–21. The execution and reporting of propensity score analysis is far from optimal in medical literature. •• Ali MS, Groenwold RH, Belitser SV, Pestman WR, Hoes AW, Roes KC, Boer A, Klungel OH. Reporting of covariate selection and balance assessment in propensity score analysis is suboptimal: a systematic review. J Clin Epidemiol. 2015;68(2):112–21. The execution and reporting of propensity score analysis is far from optimal in medical literature.
21.
go back to reference Setoguchi S, Schneeweiss S, Brookhart MA, Glynn RJ, Cook EF. Evaluating uses of data mining techniques in propensity score estimation: a simulation study. Pharmacoepidemiol Drug Saf. 2008;17(6):546–55.PubMedPubMedCentral Setoguchi S, Schneeweiss S, Brookhart MA, Glynn RJ, Cook EF. Evaluating uses of data mining techniques in propensity score estimation: a simulation study. Pharmacoepidemiol Drug Saf. 2008;17(6):546–55.PubMedPubMedCentral
22.
go back to reference Lee BK, Lessler J, Stuart EA. Improving propensity score weighting using machine learning. Stat Med. 2010;29(3):337–46.PubMedPubMedCentral Lee BK, Lessler J, Stuart EA. Improving propensity score weighting using machine learning. Stat Med. 2010;29(3):337–46.PubMedPubMedCentral
23.
go back to reference Imai K, Ratkovic M. Covariate balancing propensity score. J R Stat Soc B. 2014;76(1):243–63. Imai K, Ratkovic M. Covariate balancing propensity score. J R Stat Soc B. 2014;76(1):243–63.
24.
go back to reference Wyss R, Ellis AR, Brookhart MA, Girman CJ, Jonsson Funk M, LoCasale R, Sturmer T. The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score. Am J Epidemiol. 2014;180(6):645–55.PubMedPubMedCentral Wyss R, Ellis AR, Brookhart MA, Girman CJ, Jonsson Funk M, LoCasale R, Sturmer T. The role of prediction modeling in propensity score estimation: an evaluation of logistic regression, bCART, and the covariate-balancing propensity score. Am J Epidemiol. 2014;180(6):645–55.PubMedPubMedCentral
25.
go back to reference Lunceford JK, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med. 2004;23(19):2937–60.PubMed Lunceford JK, Davidian M. Stratification and weighting via the propensity score in estimation of causal treatment effects: a comparative study. Stat Med. 2004;23(19):2937–60.PubMed
26.
go back to reference • Austin PC. The performance of different propensity score methods for estimating marginal hazard ratios. Statistics in medicine 2013;32(16):2837–49. Matching and inverse probability of treatment weighting methods demonstrate better performances for estimating marginal hazard ratios. • Austin PC. The performance of different propensity score methods for estimating marginal hazard ratios. Statistics in medicine 2013;32(16):2837–49. Matching and inverse probability of treatment weighting methods demonstrate better performances for estimating marginal hazard ratios.
27.
go back to reference Austin PC. The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies. Stat Med. 2010;29(20):2137–48.PubMedPubMedCentral Austin PC. The performance of different propensity-score methods for estimating differences in proportions (risk differences or absolute risk reductions) in observational studies. Stat Med. 2010;29(20):2137–48.PubMedPubMedCentral
28.
go back to reference Austin PC. Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies. Pharm Stat. 2011;10(2):150–61.PubMed Austin PC. Optimal caliper widths for propensity-score matching when estimating differences in means and differences in proportions in observational studies. Pharm Stat. 2011;10(2):150–61.PubMed
29.
go back to reference • Lunt M: Selecting an appropriate caliper can be essential for achieving good balance with propensity score matching. Am J Epidemiol. 2014;179(2):226–35. A tigher caliper width leads to reduced bias and closer matches in propensity score matching analysis. • Lunt M: Selecting an appropriate caliper can be essential for achieving good balance with propensity score matching. Am J Epidemiol. 2014;179(2):226–35. A tigher caliper width leads to reduced bias and closer matches in propensity score matching analysis.
30.
go back to reference Rosenbaum PR, Rubin DB. The bias due to incomplete matching. Biometrics. 1985;41(1):103–16.PubMed Rosenbaum PR, Rubin DB. The bias due to incomplete matching. Biometrics. 1985;41(1):103–16.PubMed
31.
go back to reference Austin PC. Statistical criteria for selecting the optimal number of untreated subjects matched to each treated subject when using many-to-one matching on the propensity score. Am J Epidemiol. 2010;172(9):1092–7.PubMedPubMedCentral Austin PC. Statistical criteria for selecting the optimal number of untreated subjects matched to each treated subject when using many-to-one matching on the propensity score. Am J Epidemiol. 2010;172(9):1092–7.PubMedPubMedCentral
32.
go back to reference • Austin PC. A comparison of 12 algorithms for matching on the propensity score. Stat Med. 2014;33(6):1057–69. Nearest neighbor matching induces the same balance in baseline covariates as does optimal matching. Caliper matching tends to result in estimates of treatment effect with less bias compared with optimal and nearest neighbor matching. PubMedPubMedCentral • Austin PC. A comparison of 12 algorithms for matching on the propensity score. Stat Med. 2014;33(6):1057–69. Nearest neighbor matching induces the same balance in baseline covariates as does optimal matching. Caliper matching tends to result in estimates of treatment effect with less bias compared with optimal and nearest neighbor matching. PubMedPubMedCentral
33.
go back to reference Austin PC. Propensity-score matching in the cardiovascular surgery literature from 2004 to 2006: a systematic review and suggestions for improvement. J Thorac Cardiovasc Surg. 2007;134(5):1128–35.PubMed Austin PC. Propensity-score matching in the cardiovascular surgery literature from 2004 to 2006: a systematic review and suggestions for improvement. J Thorac Cardiovasc Surg. 2007;134(5):1128–35.PubMed
34.
go back to reference Morgan SL, Todd JL. A diagnostic routine for the detection of consequential heterogeneity of causal effects. Sociol Methodol. 2008;38:231–81. Morgan SL, Todd JL. A diagnostic routine for the detection of consequential heterogeneity of causal effects. Sociol Methodol. 2008;38:231–81.
35.
go back to reference Imai K, King G, Stuart EA. Misunderstandings among experimentalists and observationalists about causal inference. J R Stat Soc Ser A (Statistics in Society). 2008;171(2):481–502. Imai K, King G, Stuart EA. Misunderstandings among experimentalists and observationalists about causal inference. J R Stat Soc Ser A (Statistics in Society). 2008;171(2):481–502.
36.
go back to reference Altman DG, Dore CJ. Randomisation and baseline comparisons in clinical trials. Lancet. 1990;335(8682):149–53.PubMed Altman DG, Dore CJ. Randomisation and baseline comparisons in clinical trials. Lancet. 1990;335(8682):149–53.PubMed
37.
go back to reference Assmann SF, Pocock SJ, Enos LE, Kasten LE. Subgroup analysis and other (mis)uses of baseline data in clinical trials. Lancet. 2000;355(9209):1064–9.PubMed Assmann SF, Pocock SJ, Enos LE, Kasten LE. Subgroup analysis and other (mis)uses of baseline data in clinical trials. Lancet. 2000;355(9209):1064–9.PubMed
38.
go back to reference Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practice and problems. Stat Med. 2002;21(19):2917–30.PubMed Pocock SJ, Assmann SE, Enos LE, Kasten LE. Subgroup analysis, covariate adjustment and baseline comparisons in clinical trial reporting: current practice and problems. Stat Med. 2002;21(19):2917–30.PubMed
39.
go back to reference Senn S. Testing for baseline balance in clinical trials. Stat Med. 1994;13(17):1715–26.PubMed Senn S. Testing for baseline balance in clinical trials. Stat Med. 1994;13(17):1715–26.PubMed
40.
go back to reference Senn SJ. Covariate imbalance and random allocation in clinical trials. Stat Med. 1989;8(4):467–75.PubMed Senn SJ. Covariate imbalance and random allocation in clinical trials. Stat Med. 1989;8(4):467–75.PubMed
41.
go back to reference Normand ST, Landrum MB, Guadagnoli E, Ayanian JZ, Ryan TJ, Cleary PD, McNeil BJ. Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. J Clin Epidemiol. 2001;54(4):387–98.PubMed Normand ST, Landrum MB, Guadagnoli E, Ayanian JZ, Ryan TJ, Cleary PD, McNeil BJ. Validating recommendations for coronary angiography following acute myocardial infarction in the elderly: a matched analysis using propensity scores. J Clin Epidemiol. 2001;54(4):387–98.PubMed
42.
go back to reference Austin PC. Comparing paired vs non-paired statistical methods of analyses when making inferences about absolute risk reductions in propensity-score matched samples. Stat Med. 2011;30(11):1292–301.PubMedPubMedCentral Austin PC. Comparing paired vs non-paired statistical methods of analyses when making inferences about absolute risk reductions in propensity-score matched samples. Stat Med. 2011;30(11):1292–301.PubMedPubMedCentral
43.
go back to reference Greenland S. Interpretation and choice of effect measures in epidemiologic analyses. Am J Epidemiol. 1987;125(5):761–8.PubMed Greenland S. Interpretation and choice of effect measures in epidemiologic analyses. Am J Epidemiol. 1987;125(5):761–8.PubMed
44.
go back to reference Martens EP, Pestman WR, de Boer A, Belitser SV, Klungel OH. Systematic differences in treatment effect estimates between propensity score methods and logistic regression. Int J Epidemiol. 2008;37(5):1142–7.PubMed Martens EP, Pestman WR, de Boer A, Belitser SV, Klungel OH. Systematic differences in treatment effect estimates between propensity score methods and logistic regression. Int J Epidemiol. 2008;37(5):1142–7.PubMed
45.
go back to reference Austin PC. Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model. J Clin Epidemiol. 2010;63(1):2–6.PubMed Austin PC. Absolute risk reductions, relative risks, relative risk reductions, and numbers needed to treat can be obtained from a logistic regression model. J Clin Epidemiol. 2010;63(1):2–6.PubMed
46.
go back to reference Austin PC. Absolute risk reductions and numbers needed to treat can be obtained from adjusted survival models for time-to-event outcomes. J Clin Epidemiol. 2010;63(1):46–55.PubMed Austin PC. Absolute risk reductions and numbers needed to treat can be obtained from adjusted survival models for time-to-event outcomes. J Clin Epidemiol. 2010;63(1):46–55.PubMed
47.
go back to reference Rubin DB. Using propensity scores to help design observational studies: application to the tobacco litigation. Health Serv Outcomes Res Method. 2001;2:169–88. Rubin DB. Using propensity scores to help design observational studies: application to the tobacco litigation. Health Serv Outcomes Res Method. 2001;2:169–88.
48.
go back to reference Miettinen OS. Stratification by a multivariate confounder score. Am J Epidemiol. 1976;104(6):609–20.PubMed Miettinen OS. Stratification by a multivariate confounder score. Am J Epidemiol. 1976;104(6):609–20.PubMed
49.
go back to reference Sturmer T, Schneeweiss S, Brookhart MA, Rothman KJ, Avorn J, Glynn RJ. Analytic strategies to adjust confounding using exposure propensity scores and disease risk scores: nonsteroidal antiinflammatory drugs and short-term mortality in the elderly. Am J Epidemiol. 2005;161(9):891–8.PubMed Sturmer T, Schneeweiss S, Brookhart MA, Rothman KJ, Avorn J, Glynn RJ. Analytic strategies to adjust confounding using exposure propensity scores and disease risk scores: nonsteroidal antiinflammatory drugs and short-term mortality in the elderly. Am J Epidemiol. 2005;161(9):891–8.PubMed
50.
go back to reference Wyss R, Ellis AR, Brookhart MA, Jonsson Funk M, Girman CJ, Simpson RJ, Jr., Sturmer T. Matching on the disease risk score in comparative effectiveness research of new treatments. Pharmacoepidemiol Drug Saf. 2015;24(9):951–61.PubMedPubMedCentral Wyss R, Ellis AR, Brookhart MA, Jonsson Funk M, Girman CJ, Simpson RJ, Jr., Sturmer T. Matching on the disease risk score in comparative effectiveness research of new treatments. Pharmacoepidemiol Drug Saf. 2015;24(9):951–61.PubMedPubMedCentral
51.
go back to reference Tadrous M, Gagne JJ, Sturmer T, Cadarette SM. Disease risk score as a confounder summary method: systematic review and recommendations. Pharmacoepidemiol Drug Saf. 2013;22(2):122–9.PubMed Tadrous M, Gagne JJ, Sturmer T, Cadarette SM. Disease risk score as a confounder summary method: systematic review and recommendations. Pharmacoepidemiol Drug Saf. 2013;22(2):122–9.PubMed
Metadata
Title
Causal Inference in Anesthesia and Perioperative Observational Studies
Authors
Tri-Long Nguyen
Audrey Winter
Jessica Spence
Géraldine Leguelinel-Blache
Paul Landais
Yannick Le Manach
Publication date
01-09-2016
Publisher
Springer US
Published in
Current Anesthesiology Reports / Issue 3/2016
Electronic ISSN: 2167-6275
DOI
https://doi.org/10.1007/s40140-016-0174-5

Other articles of this Issue 3/2016

Current Anesthesiology Reports 3/2016 Go to the issue

Neuroanesthesia (M Smith, Section Editor)

Postoperative Care of Neurosurgical Patients

Research Methods and Statistical Analyses (Y Le Manach, Section Editor)

Emerging Methodology of Intraoperative Hemodynamic Monitoring Research

Research Methods and Statistical Analyses (Y Le Manach, Section Editor)

Health-Economic Researches in Perioperative Medicine