Skip to main content
Top
Published in: European Journal of Epidemiology 7/2015

Open Access 01-07-2015 | METHODS

Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors

Authors: Stephen Burgess, Robert A. Scott, Nicholas J. Timpson, George Davey Smith, Simon G. Thompson, EPIC- InterAct Consortium

Published in: European Journal of Epidemiology | Issue 7/2015

Login to get access

Abstract

Finding individual-level data for adequately-powered Mendelian randomization analyses may be problematic. As publicly-available summarized data on genetic associations with disease outcomes from large consortia are becoming more abundant, use of published data is an attractive analysis strategy for obtaining precise estimates of the causal effects of risk factors on outcomes. We detail the necessary steps for conducting Mendelian randomization investigations using published data, and present novel statistical methods for combining data on the associations of multiple (correlated or uncorrelated) genetic variants with the risk factor and outcome into a single causal effect estimate. A two-sample analysis strategy may be employed, in which evidence on the gene-risk factor and gene-outcome associations are taken from different data sources. These approaches allow the efficient identification of risk factors that are suitable targets for clinical intervention from published data, although the ability to assess the assumptions necessary for causal inference is diminished. Methods and guidance are illustrated using the example of the causal effect of serum calcium levels on fasting glucose concentrations. The estimated causal effect of a 1 standard deviation (0.13 mmol/L) increase in calcium levels on fasting glucose (mM) using a single lead variant from the CASR gene region is 0.044 (95 % credible interval −0.002, 0.100). In contrast, using our method to account for the correlation between variants, the corresponding estimate using 17 genetic variants is 0.022 (95 % credible interval 0.009, 0.035), a more clearly positive causal effect.
Appendix
Available only for authorised users
Literature
1.
go back to reference Davey Smith G, Ebrahim S. Mendelian randomization: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1–22. doi:10.1093/ije/dyg070.CrossRef Davey Smith G, Ebrahim S. Mendelian randomization: can genetic epidemiology contribute to understanding environmental determinants of disease? Int J Epidemiol. 2003;32(1):1–22. doi:10.​1093/​ije/​dyg070.CrossRef
2.
go back to reference Lawlor D, Harbord R, Sterne J, Timpson N, Davey Smith G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med. 2008;27(8):1133–63. doi:10.1002/sim.3034.PubMedCrossRef Lawlor D, Harbord R, Sterne J, Timpson N, Davey Smith G. Mendelian randomization: using genes as instruments for making causal inferences in epidemiology. Stat Med. 2008;27(8):1133–63. doi:10.​1002/​sim.​3034.PubMedCrossRef
6.
go back to reference Burgess S, Butterworth A, Malarstig A, Thompson S. Use of Mendelian randomisation to assess potential benefit of clinical intervention. Br Med J. 2012;345:e7325. doi:10.1136/bmj.e7325.CrossRef Burgess S, Butterworth A, Malarstig A, Thompson S. Use of Mendelian randomisation to assess potential benefit of clinical intervention. Br Med J. 2012;345:e7325. doi:10.​1136/​bmj.​e7325.CrossRef
7.
go back to reference Kamstrup P, Tybjaerg-Hansen A, Steffensen R, Nordestgaard B. Genetically elevated lipoprotein(a) and increased risk of myocardial infarction. J Am Med Assoc. 2009;301(22):2331–9. doi:10.1001/jama.2009.801.CrossRef Kamstrup P, Tybjaerg-Hansen A, Steffensen R, Nordestgaard B. Genetically elevated lipoprotein(a) and increased risk of myocardial infarction. J Am Med Assoc. 2009;301(22):2331–9. doi:10.​1001/​jama.​2009.​801.CrossRef
8.
go back to reference The Interleukin-6 Receptor Mendelian Randomisation Analysis Consortium. The interleukin-6 receptor as a target for prevention of coronary heart disease: a Mendelian randomisation analysis. Lancet. 2012;379(9822):1214–1224. doi:10.1016/s0140-6736(12)60110-x.CrossRef The Interleukin-6 Receptor Mendelian Randomisation Analysis Consortium. The interleukin-6 receptor as a target for prevention of coronary heart disease: a Mendelian randomisation analysis. Lancet. 2012;379(9822):1214–1224. doi:10.​1016/​s0140-6736(12)60110-x.CrossRef
9.
go back to reference Keavney B, Danesh J, Parish S, Palmer A, Clark S, Youngman L, Delepine M, Lathrop M, Peto R, Collins R, et al. Fibrinogen and coronary heart disease: test of causality by ‘Mendelian randomization’. Int J Epidemiol. 2006;35(4):935–43. doi:10.1093/ije/dyl114.PubMedCrossRef Keavney B, Danesh J, Parish S, Palmer A, Clark S, Youngman L, Delepine M, Lathrop M, Peto R, Collins R, et al. Fibrinogen and coronary heart disease: test of causality by ‘Mendelian randomization’. Int J Epidemiol. 2006;35(4):935–43. doi:10.​1093/​ije/​dyl114.PubMedCrossRef
10.
go back to reference CRP CHD Genetics Collaboration. Association between C reactive protein and coronary heart disease: Mendelian randomisation analysis based on individual participant data. Br Med J. 2011;342:d548. doi:10.1136/bmj.d548. CRP CHD Genetics Collaboration. Association between C reactive protein and coronary heart disease: Mendelian randomisation analysis based on individual participant data. Br Med J. 2011;342:d548. doi:10.​1136/​bmj.​d548.
11.
go back to reference Palmer TM, Nordestgaard BG, Benn M, Tybjærg-Hansen A, Smith GD, Lawlor DA, Timpson NJ. Association of plasma uric acid with ischaemic heart disease and blood pressure: Mendelian randomisation analysis of two large cohorts. Br Med J. 2013;347:f4262. doi:10.1136/bmj.f4262.CrossRef Palmer TM, Nordestgaard BG, Benn M, Tybjærg-Hansen A, Smith GD, Lawlor DA, Timpson NJ. Association of plasma uric acid with ischaemic heart disease and blood pressure: Mendelian randomisation analysis of two large cohorts. Br Med J. 2013;347:f4262. doi:10.​1136/​bmj.​f4262.CrossRef
12.
go back to reference Schatzkin A, Abnet C, Cross A, Gunter M, Pfeiffer R, Gail M, Lim U, Davey Smith G. Mendelian randomization: how it can—and cannot—help confirm causal relations between nutrition and cancer. Cancer Prev Res. 2009;2(2):104–13. doi:10.1158/1940-6207.capr-08-0070.CrossRef Schatzkin A, Abnet C, Cross A, Gunter M, Pfeiffer R, Gail M, Lim U, Davey Smith G. Mendelian randomization: how it can—and cannot—help confirm causal relations between nutrition and cancer. Cancer Prev Res. 2009;2(2):104–13. doi:10.​1158/​1940-6207.​capr-08-0070.CrossRef
13.
go back to reference Schunkert H, König I, Kathiresan S, Reilly M, Assimes T, Holm H, Preuss M, Stewart A, Barbalic M, Gieger C, et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat Genet. 2011;43(4):333–8. doi:10.1038/ng.784.PubMedCentralPubMedCrossRef Schunkert H, König I, Kathiresan S, Reilly M, Assimes T, Holm H, Preuss M, Stewart A, Barbalic M, Gieger C, et al. Large-scale association analysis identifies 13 new susceptibility loci for coronary artery disease. Nat Genet. 2011;43(4):333–8. doi:10.​1038/​ng.​784.PubMedCentralPubMedCrossRef
14.
go back to reference Morris A, Voight B, Teslovich T, Ferreira T, Segre A, Steinthorsdottir V, Strawbridge R, Khan H, Grallert H, Mahajan A, et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012;44(9):981–90. doi:10.1038/ng.2383.PubMedCentralPubMedCrossRef Morris A, Voight B, Teslovich T, Ferreira T, Segre A, Steinthorsdottir V, Strawbridge R, Khan H, Grallert H, Mahajan A, et al. Large-scale association analysis provides insights into the genetic architecture and pathophysiology of type 2 diabetes. Nat Genet. 2012;44(9):981–90. doi:10.​1038/​ng.​2383.PubMedCentralPubMedCrossRef
16.
go back to reference Hindorff L, MacArthur J, Morales J, Junkins H, Hall P, Klemm A, Manolio T. A catalog of published genome-wide association studies. Technical Report, European Bioinformatics Institute 2013. www.genome.gov/gwastudies. Accessed 11 July 2013. Hindorff L, MacArthur J, Morales J, Junkins H, Hall P, Klemm A, Manolio T. A catalog of published genome-wide association studies. Technical Report, European Bioinformatics Institute 2013. www.​genome.​gov/​gwastudies. Accessed 11 July 2013.
17.
go back to reference Inoue A, Solon G. Two-sample instrumental variables estimators. Rev Econ Stat. 2010;92(3):557–61.CrossRef Inoue A, Solon G. Two-sample instrumental variables estimators. Rev Econ Stat. 2010;92(3):557–61.CrossRef
19.
22.
go back to reference Baum C, Schaffer M, Stillman S. Instrumental variables and GMM: estimation and testing. Stata J. 2003;3(1):1–31. Baum C, Schaffer M, Stillman S. Instrumental variables and GMM: estimation and testing. Stata J. 2003;3(1):1–31.
23.
go back to reference Basmann R. On finite sample distributions of generalized classical linear identifiability test statistics. J Am Stat Assoc. 1960;55(292):650–9.CrossRef Basmann R. On finite sample distributions of generalized classical linear identifiability test statistics. J Am Stat Assoc. 1960;55(292):650–9.CrossRef
24.
go back to reference Sargan J. The estimation of economic relationships using instrumental variables. Econometrica. 1958;26(3):393–415.CrossRef Sargan J. The estimation of economic relationships using instrumental variables. Econometrica. 1958;26(3):393–415.CrossRef
26.
go back to reference Wareham NJ, Byrne CD, Carr C, Day NE, Boucher BJ, Hales CN. Glucose intolerance is associated with altered calcium homeostasis: a possible link between increased serum calcium concentration and cardiovascular disease mortality. Metabolism. 1997;46(10):1171–7. doi:10.1016/s0026-0495(97)90212-2.PubMedCrossRef Wareham NJ, Byrne CD, Carr C, Day NE, Boucher BJ, Hales CN. Glucose intolerance is associated with altered calcium homeostasis: a possible link between increased serum calcium concentration and cardiovascular disease mortality. Metabolism. 1997;46(10):1171–7. doi:10.​1016/​s0026-0495(97)90212-2.PubMedCrossRef
27.
go back to reference Forouhi N, Ye Z, Rickard A, Khaw K, Luben R, Langenberg C, Wareham N. Circulating 25-hydroxyvitamin D concentration and the risk of type 2 diabetes: results from the European Prospective Investigation into Cancer (EPIC)-Norfolk cohort and updated meta-analysis of prospective studies. Diabetologia. 2012;55(8):2173–82. doi:10.1007/s00125-012-2544-y.PubMedCrossRef Forouhi N, Ye Z, Rickard A, Khaw K, Luben R, Langenberg C, Wareham N. Circulating 25-hydroxyvitamin D concentration and the risk of type 2 diabetes: results from the European Prospective Investigation into Cancer (EPIC)-Norfolk cohort and updated meta-analysis of prospective studies. Diabetologia. 2012;55(8):2173–82. doi:10.​1007/​s00125-012-2544-y.PubMedCrossRef
28.
go back to reference Langenberg C, Sharp S, Forouhi N, Franks P, Schulze M, Kerrison N, Ekelund U, Barroso I, Panico S, Tormo M, et al. Design and cohort description of the InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 2011;54(9):2272–82. doi:10.1007/s00125-011-2182-9.PubMedCentralPubMedCrossRef Langenberg C, Sharp S, Forouhi N, Franks P, Schulze M, Kerrison N, Ekelund U, Barroso I, Panico S, Tormo M, et al. Design and cohort description of the InterAct Project: an examination of the interaction of genetic and lifestyle factors on the incidence of type 2 diabetes in the EPIC Study. Diabetologia. 2011;54(9):2272–82. doi:10.​1007/​s00125-011-2182-9.PubMedCentralPubMedCrossRef
29.
go back to reference Scott RA, Lagou V, Welch RP, Wheeler E, Montasser ME, Luan J, Mägi R, Strawbridge RJ, Rehnberg E, Gustafsson S, et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat Genet. 2012;44(9):991–1005. doi:10.1038/ng.2385.PubMedCentralPubMedCrossRef Scott RA, Lagou V, Welch RP, Wheeler E, Montasser ME, Luan J, Mägi R, Strawbridge RJ, Rehnberg E, Gustafsson S, et al. Large-scale association analyses identify new loci influencing glycemic traits and provide insight into the underlying biological pathways. Nat Genet. 2012;44(9):991–1005. doi:10.​1038/​ng.​2385.PubMedCentralPubMedCrossRef
30.
go back to reference Kapur K, Johnson T, Beckmann ND, Sehmi J, Tanaka T, Kutalik Z, Styrkarsdottir U, Zhang W, Marek D, Gudbjartsson DF, et al. Genome-wide meta-analysis for serum calcium identifies significantly associated SNPs near the calcium-sensing receptor (CASR) gene. PLoS Genet. 2010;6(7):e1001035. doi:10.1371/journal.pgen.1001035.PubMedCentralPubMedCrossRef Kapur K, Johnson T, Beckmann ND, Sehmi J, Tanaka T, Kutalik Z, Styrkarsdottir U, Zhang W, Marek D, Gudbjartsson DF, et al. Genome-wide meta-analysis for serum calcium identifies significantly associated SNPs near the calcium-sensing receptor (CASR) gene. PLoS Genet. 2010;6(7):e1001035. doi:10.​1371/​journal.​pgen.​1001035.PubMedCentralPubMedCrossRef
31.
go back to reference O’Seaghdha CM, Yang Q, Glazer NL, Leak TS, Dehghan A, Smith AV, Kao WL, Lohman K, Hwang SJ, Johnson AD, et al. Common variants in the calcium-sensing receptor gene are associated with total serum calcium levels. Hum Mol Genet. 2010;19(21):4296–303. doi:10.1093/hmg/ddq342.PubMedCentralPubMedCrossRef O’Seaghdha CM, Yang Q, Glazer NL, Leak TS, Dehghan A, Smith AV, Kao WL, Lohman K, Hwang SJ, Johnson AD, et al. Common variants in the calcium-sensing receptor gene are associated with total serum calcium levels. Hum Mol Genet. 2010;19(21):4296–303. doi:10.​1093/​hmg/​ddq342.PubMedCentralPubMedCrossRef
33.
go back to reference Yu B, Barbalic M, Brautbar A, Nambi V, Hoogeveen RC, Tang W, Mosley TH, Rotter JI, O’Donnell CJ, Kathiresan S, et al. Association of genome-wide variation with highly sensitive cardiac troponin-T levels in European Americans and Blacks: a meta-analysis from Atherosclerosis Risk in Communities and Cardiovascular Health Studies. Circ Cardiovasc Genet. 2013;6(1):82–8. doi:10.1161/circgenetics.112.963058.PubMedCentralPubMedCrossRef Yu B, Barbalic M, Brautbar A, Nambi V, Hoogeveen RC, Tang W, Mosley TH, Rotter JI, O’Donnell CJ, Kathiresan S, et al. Association of genome-wide variation with highly sensitive cardiac troponin-T levels in European Americans and Blacks: a meta-analysis from Atherosclerosis Risk in Communities and Cardiovascular Health Studies. Circ Cardiovasc Genet. 2013;6(1):82–8. doi:10.​1161/​circgenetics.​112.​963058.PubMedCentralPubMedCrossRef
35.
go back to reference Würtz P, Kangas AJ, Soininen P, Lehtimäki T, Kähönen M, Viikari JS, Raitakari OT, Järvelin MR, Davey Smith G, Ala-Korpela M. Lipoprotein subclass profiling reveals pleiotropy in the genetic variants of lipid risk factors for coronary heart disease: a note on Mendelian randomization studies. J Am Coll Cardiol. 2013;62(20):1906–8. doi:10.1016/j.jacc.2013.07.085.PubMedCrossRef Würtz P, Kangas AJ, Soininen P, Lehtimäki T, Kähönen M, Viikari JS, Raitakari OT, Järvelin MR, Davey Smith G, Ala-Korpela M. Lipoprotein subclass profiling reveals pleiotropy in the genetic variants of lipid risk factors for coronary heart disease: a note on Mendelian randomization studies. J Am Coll Cardiol. 2013;62(20):1906–8. doi:10.​1016/​j.​jacc.​2013.​07.​085.PubMedCrossRef
36.
go back to reference Burgess S, Thompson S. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181(4):251–60. Burgess S, Thompson S. Multivariable Mendelian randomization: the use of pleiotropic genetic variants to estimate causal effects. Am J Epidemiol. 2015;181(4):251–60.
37.
go back to reference Burgess S, Thompson S, CRP CHD Genetics Collaboration. Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes. Stat Methods Med Res. 2012; doi:10.1177/0962280212451882.PubMed Burgess S, Thompson S, CRP CHD Genetics Collaboration. Methods for meta-analysis of individual participant data from Mendelian randomization studies with binary outcomes. Stat Methods Med Res. 2012; doi:10.​1177/​0962280212451882​.PubMed
38.
go back to reference Gidding S, Daniels S, Kavey R. Expert Panel on Cardiovascular Health and Risk Reduction in Youth. Developing the 2011 integrated pediatric guidelines for cardiovascular risk reduction. Pediatrics. 2012;129(5):e1311–9. doi:10.1542/peds.2011-2903.PubMedCrossRef Gidding S, Daniels S, Kavey R. Expert Panel on Cardiovascular Health and Risk Reduction in Youth. Developing the 2011 integrated pediatric guidelines for cardiovascular risk reduction. Pediatrics. 2012;129(5):e1311–9. doi:10.​1542/​peds.​2011-2903.PubMedCrossRef
Metadata
Title
Using published data in Mendelian randomization: a blueprint for efficient identification of causal risk factors
Authors
Stephen Burgess
Robert A. Scott
Nicholas J. Timpson
George Davey Smith
Simon G. Thompson
EPIC- InterAct Consortium
Publication date
01-07-2015
Publisher
Springer Netherlands
Published in
European Journal of Epidemiology / Issue 7/2015
Print ISSN: 0393-2990
Electronic ISSN: 1573-7284
DOI
https://doi.org/10.1007/s10654-015-0011-z

Other articles of this Issue 7/2015

European Journal of Epidemiology 7/2015 Go to the issue