Skip to main content
Top
Published in: European Journal of Epidemiology 1/2017

01-01-2017 | REVIEW

Statistical inference in abstracts of major medical and epidemiology journals 1975–2014: a systematic review

Authors: Andreas Stang, Markus Deckert, Charles Poole, Kenneth J. Rothman

Published in: European Journal of Epidemiology | Issue 1/2017

Login to get access

Abstract

Since its introduction in the twentieth century, null hypothesis significance testing (NHST), a hybrid of significance testing (ST) advocated by Fisher and null hypothesis testing (NHT) developed by Neyman and Pearson, has become widely adopted but has also been a source of debate. The principal alternative to such testing is estimation with point estimates and confidence intervals (CI). Our aim was to estimate time trends in NHST, ST, NHT and CI reporting in abstracts of major medical and epidemiological journals. We reviewed 89,533 abstracts in five major medical journals and seven major epidemiological journals, 1975–2014, and estimated time trends in the proportions of abstracts containing statistical inference. In those abstracts, we estimated time trends in the proportions relying on NHST and its major variants, ST and NHT, and in the proportions reporting CIs without explicit use of NHST (CI-only approach). The CI-only approach rose monotonically during the study period in the abstracts of all journals. In Epidemiology abstracts, as a result of the journal’s editorial policy, the CI-only approach has always been the most common approach. In the other 11 journals, the NHST approach started out more common, but by 2014, this disparity had narrowed, disappeared or reversed in 9 of them. The exceptions were JAMA, New England Journal of Medicine, and Lancet abstracts, where the predominance of the NHST approach prevailed over time. In 2014, the CI-only approach is as popular as the NHST approach in the abstracts of 4 of the epidemiology journals: the American Journal of Epidemiology (48%), the Annals of Epidemiology (55%), Epidemiology (79%) and the International Journal of Epidemiology (52%). The reporting of CIs without explicitly interpreting them as statistical tests is becoming more common in abstracts, particularly in epidemiology journals. Although NHST is becoming less popular in abstracts of most epidemiology journals studied and some widely read medical journals, it is still very common in the abstracts of other widely read medical journals, especially in the hybrid form of ST and NHT in which p values are reported numerically along with declarations of the presence or absence of statistical significance.
Appendix
Available only for authorised users
Literature
1.
go back to reference Gigerenzer G, Swijtink Z, Porter T, Daston L, Beatty J, Krüger L. The empire of chance. How probability changed science and everyday life. Cambridge: Cambridge University Press; 1989.CrossRef Gigerenzer G, Swijtink Z, Porter T, Daston L, Beatty J, Krüger L. The empire of chance. How probability changed science and everyday life. Cambridge: Cambridge University Press; 1989.CrossRef
2.
go back to reference Anderson DR, Burnham KP, Thompson WL. Null hypothesis testing: problems, prevalence, and an alternative. J Wildl Manag. 2000;64(4):912–23.CrossRef Anderson DR, Burnham KP, Thompson WL. Null hypothesis testing: problems, prevalence, and an alternative. J Wildl Manag. 2000;64(4):912–23.CrossRef
4.
go back to reference International Committee of Medical Journal Editors. Uniform requirements for manuscripts submitted to biomedical journals. Br Med J (Clin Res Ed). 1988;296(6619):401–5.CrossRef International Committee of Medical Journal Editors. Uniform requirements for manuscripts submitted to biomedical journals. Br Med J (Clin Res Ed). 1988;296(6619):401–5.CrossRef
5.
go back to reference Gardner MJ, Altman DG. Confidence intervals rather than P values: estimation rather than hypothesis testing. Br Med J (Clin Res Ed). 1986;292(6522):746–50.CrossRef Gardner MJ, Altman DG. Confidence intervals rather than P values: estimation rather than hypothesis testing. Br Med J (Clin Res Ed). 1986;292(6522):746–50.CrossRef
9.
go back to reference Greenland S, Senn SJ, Rothman KJ, et al. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016;31(4):337–50.CrossRefPubMedPubMedCentral Greenland S, Senn SJ, Rothman KJ, et al. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations. Eur J Epidemiol. 2016;31(4):337–50.CrossRefPubMedPubMedCentral
10.
go back to reference Wasserstein RL, Lazar NA. The ASA’s statement on p-values: context, process, and purpose. Am Stat. 2016;70(2):129–33.CrossRef Wasserstein RL, Lazar NA. The ASA’s statement on p-values: context, process, and purpose. Am Stat. 2016;70(2):129–33.CrossRef
11.
go back to reference Walter SD. Methods of reporting statistical results from medical research studies. Am J Epidemiol. 1995;141(10):896–906.CrossRefPubMed Walter SD. Methods of reporting statistical results from medical research studies. Am J Epidemiol. 1995;141(10):896–906.CrossRefPubMed
12.
go back to reference Gastwirth JL. Statistical considerations support the supreme court’s decision in Matrixx Initiatives v. Siracusano. Jurimetrics. 2012;52:155–75. Gastwirth JL. Statistical considerations support the supreme court’s decision in Matrixx Initiatives v. Siracusano. Jurimetrics. 2012;52:155–75.
13.
14.
go back to reference Anonymous. Psychology journal bans P values. Nature 2015; 519:9. Anonymous. Psychology journal bans P values. Nature 2015; 519:9.
15.
go back to reference Savitz DA, Tolo KA, Poole C. Statistical significance testing in the American Journal of Epidemiology, 1970–1990. Am J Epidemiol. 1994;139(10):1047–52.CrossRefPubMed Savitz DA, Tolo KA, Poole C. Statistical significance testing in the American Journal of Epidemiology, 1970–1990. Am J Epidemiol. 1994;139(10):1047–52.CrossRefPubMed
16.
go back to reference Fidler F, Thomason N, Cumming G, Finch S, Leeman J. Editors can lead researchers to confidence intervals, but can’t make them think: statistical reform lessons from medicine. Psychol Sci. 2004;15(2):119–26.CrossRefPubMed Fidler F, Thomason N, Cumming G, Finch S, Leeman J. Editors can lead researchers to confidence intervals, but can’t make them think: statistical reform lessons from medicine. Psychol Sci. 2004;15(2):119–26.CrossRefPubMed
17.
go back to reference MacArthur RD, Jackson GG. An evaluation of the use of statistical methodology in the. J Infect Dis. 1984;149(3):349–54.CrossRefPubMed MacArthur RD, Jackson GG. An evaluation of the use of statistical methodology in the. J Infect Dis. 1984;149(3):349–54.CrossRefPubMed
18.
go back to reference Vacha-Haase T, Nilsson JE, Reetz DR, Lance TS, Thompson B. Reporting practices and APA editorial policies regarding statistical significance and effect size. Theory Psychol. 2000;10(3):413–25.CrossRef Vacha-Haase T, Nilsson JE, Reetz DR, Lance TS, Thompson B. Reporting practices and APA editorial policies regarding statistical significance and effect size. Theory Psychol. 2000;10(3):413–25.CrossRef
19.
go back to reference Chavalarias D, Wallach JD, Li AH, Ioannidis JP. Evolution of reporting P values in the biomedical literature, 1990–2015. JAMA. 2016;315(11):1141–8.CrossRefPubMed Chavalarias D, Wallach JD, Li AH, Ioannidis JP. Evolution of reporting P values in the biomedical literature, 1990–2015. JAMA. 2016;315(11):1141–8.CrossRefPubMed
20.
go back to reference Fritz A, Scherndl T, Kühlberger A. A comprehensive review of reporting practices in psychological journals: are effect sizes really enough? Theory Psychol. 2012;23(1):98–112.CrossRef Fritz A, Scherndl T, Kühlberger A. A comprehensive review of reporting practices in psychological journals: are effect sizes really enough? Theory Psychol. 2012;23(1):98–112.CrossRef
21.
go back to reference Thompson B. Journal editorial policies regarding statistical significance tests: heat is to fire as p is to importance. Educ Psychol Rev. 1999;11(2):157–69.CrossRef Thompson B. Journal editorial policies regarding statistical significance tests: heat is to fire as p is to importance. Educ Psychol Rev. 1999;11(2):157–69.CrossRef
22.
go back to reference Cleveland WS, Devlin S, Grosse E. Regression by local fitting. J Econom. 1988;37:87–114.CrossRef Cleveland WS, Devlin S, Grosse E. Regression by local fitting. J Econom. 1988;37:87–114.CrossRef
23.
go back to reference Cleveland WS, Grosse E. Computational methods for local regression. Stat Comput. 1991;1:47–62.CrossRef Cleveland WS, Grosse E. Computational methods for local regression. Stat Comput. 1991;1:47–62.CrossRef
24.
go back to reference Newcombe RG. Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med. 1998;17(8):857–72.CrossRefPubMed Newcombe RG. Two-sided confidence intervals for the single proportion: comparison of seven methods. Stat Med. 1998;17(8):857–72.CrossRefPubMed
25.
go back to reference Milne PH. Presentation graphics for engineering, science, and business. London: E & FN Spon; 2005. Milne PH. Presentation graphics for engineering, science, and business. London: E & FN Spon; 2005.
26.
27.
go back to reference Felson DT, Cupples LA, Meenan RF. Misuse of statistical methods in Arthritis and Rheumatism. 1982 versus 1967-68. Arthritis Rheum. 1984;27(9):1018–22.CrossRefPubMed Felson DT, Cupples LA, Meenan RF. Misuse of statistical methods in Arthritis and Rheumatism. 1982 versus 1967-68. Arthritis Rheum. 1984;27(9):1018–22.CrossRefPubMed
28.
go back to reference Arnold LD, Braganza M, Salih R, Colditz GA. Statistical trends in the Journal of the American Medical Association and implications for training across the continuum of medical education. PLoS ONE. 2013;8(10):e77301.CrossRefPubMedPubMedCentral Arnold LD, Braganza M, Salih R, Colditz GA. Statistical trends in the Journal of the American Medical Association and implications for training across the continuum of medical education. PLoS ONE. 2013;8(10):e77301.CrossRefPubMedPubMedCentral
29.
go back to reference Jin Z, Yu D, Zhang L, et al. A retrospective survey of research design and statistical analyses in selected Chinese medical journals in 1998 and 2008. PLoS ONE. 2010;5(5):e10822.CrossRefPubMedPubMedCentral Jin Z, Yu D, Zhang L, et al. A retrospective survey of research design and statistical analyses in selected Chinese medical journals in 1998 and 2008. PLoS ONE. 2010;5(5):e10822.CrossRefPubMedPubMedCentral
31.
go back to reference Deeks JJ, Higgins JPT, Altman DG. Analysing data and undertaking meta-analyses. In: Higgins JPT, Green S, editors. Cochrane handbook for systematic reviews of interventions version 510 (updated March 2011): Cochrane Collaboration (www.handbook.cochrane.com); 2011. Deeks JJ, Higgins JPT, Altman DG. Analysing data and undertaking meta-analyses. In: Higgins JPT, Green S, editors. Cochrane handbook for systematic reviews of interventions version 510 (updated March 2011): Cochrane Collaboration (www.​handbook.​cochrane.​com); 2011.
32.
go back to reference Koricheva J, Gurevitch J. Place of meta-analysis among other methods of research synthesis. In: Koricheva J, Gurevitch J, Mengerson K, editors. Handbook of meta-analysis in ecology and evolution. Princeton: Princeton University Press; 2013. p. 1–13. Koricheva J, Gurevitch J. Place of meta-analysis among other methods of research synthesis. In: Koricheva J, Gurevitch J, Mengerson K, editors. Handbook of meta-analysis in ecology and evolution. Princeton: Princeton University Press; 2013. p. 1–13.
33.
go back to reference Freemantle N, Geddes J. Understanding and interpreting systematic reviews and meta-analyses. Part 2: meta-analyses. Evid Based. Mental Health. 1998;1:102–4. Freemantle N, Geddes J. Understanding and interpreting systematic reviews and meta-analyses. Part 2: meta-analyses. Evid Based. Mental Health. 1998;1:102–4.
34.
go back to reference Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. Introduction to meta-analysis. Chichester: Wiley; 2009. P. 251–5, 297–302, 325–31. Borenstein M, Hedges LV, Higgins JPT, Rothstein HR. Introduction to meta-analysis. Chichester: Wiley; 2009. P. 251–5, 297–302, 325–31.
Metadata
Title
Statistical inference in abstracts of major medical and epidemiology journals 1975–2014: a systematic review
Authors
Andreas Stang
Markus Deckert
Charles Poole
Kenneth J. Rothman
Publication date
01-01-2017
Publisher
Springer Netherlands
Published in
European Journal of Epidemiology / Issue 1/2017
Print ISSN: 0393-2990
Electronic ISSN: 1573-7284
DOI
https://doi.org/10.1007/s10654-016-0211-1

Other articles of this Issue 1/2017

European Journal of Epidemiology 1/2017 Go to the issue