Skip to main content
Top
Published in: Sports Medicine 3/2014

01-03-2014 | Current Opinion

Distinguishing Between Statistical Significance and Practical/Clinical Meaningfulness Using Statistical Inference

Author: Michael Wilkinson

Published in: Sports Medicine | Issue 3/2014

Login to get access

Abstract

Decisions about support for predictions of theories in light of data are made using statistical inference. The dominant approach in sport and exercise science is the Neyman–Pearson (N–P) significance-testing approach. When applied correctly it provides a reliable procedure for making dichotomous decisions for accepting or rejecting zero-effect null hypotheses with known and controlled long-run error rates. Type I and type II error rates must be specified in advance and the latter controlled by conducting an a priori sample size calculation. The N–P approach does not provide the probability of hypotheses or indicate the strength of support for hypotheses in light of data, yet many scientists believe it does. Outcomes of analyses allow conclusions only about the existence of non-zero effects, and provide no information about the likely size of true effects or their practical/clinical value. Bayesian inference can show how much support data provide for different hypotheses, and how personal convictions should be altered in light of data, but the approach is complicated by formulating probability distributions about prior subjective estimates of population effects. A pragmatic solution is magnitude-based inference, which allows scientists to estimate the true magnitude of population effects and how likely they are to exceed an effect magnitude of practical/clinical importance, thereby integrating elements of subjective Bayesian-style thinking. While this approach is gaining acceptance, progress might be hastened if scientists appreciate the shortcomings of traditional N–P null hypothesis significance testing.
Literature
1.
go back to reference Popper KR. The logic of scientific discovery. 6th ed. London: Hutchinson & Co Ltd; 1972. Popper KR. The logic of scientific discovery. 6th ed. London: Hutchinson & Co Ltd; 1972.
2.
go back to reference Popper KR. Conjectures and refutations: the growth of scientific knowledge. 4th ed. London: Routledge and Kegan Paul Ltd; 1972. Popper KR. Conjectures and refutations: the growth of scientific knowledge. 4th ed. London: Routledge and Kegan Paul Ltd; 1972.
3.
go back to reference Ziliak ST, McClaskey DN. The cult of statistical significance: how the standard error costs us jobs, justice, and lives. Michigan: University of Michigan Press; 2008. Ziliak ST, McClaskey DN. The cult of statistical significance: how the standard error costs us jobs, justice, and lives. Michigan: University of Michigan Press; 2008.
4.
go back to reference Batterham AM, Hopkins WG. Making meaningful inferences about magnitudes. Int J Sport Phys Perf. 2006;1:50–7. Batterham AM, Hopkins WG. Making meaningful inferences about magnitudes. Int J Sport Phys Perf. 2006;1:50–7.
5.
go back to reference Krantz DH. The null hypothesis testing controversy in psychology. J Am Stat Assoc. 1999;94(448):1372–81.CrossRef Krantz DH. The null hypothesis testing controversy in psychology. J Am Stat Assoc. 1999;94(448):1372–81.CrossRef
6.
go back to reference Sterne JAC, Smith GD. Sifting the evidence: what’s wrong with significance tests? BMJ. 2001;322:226–31.PubMedCrossRef Sterne JAC, Smith GD. Sifting the evidence: what’s wrong with significance tests? BMJ. 2001;322:226–31.PubMedCrossRef
7.
go back to reference Fisher R. Statistical methods for research workers. London: Oliver and Boyd; 1950. Fisher R. Statistical methods for research workers. London: Oliver and Boyd; 1950.
8.
go back to reference Fisher R. Statistical methods and scientific inference. London: Collins Macmillan; 1973. Fisher R. Statistical methods and scientific inference. London: Collins Macmillan; 1973.
9.
go back to reference Neyman J, Pearson ES. On the problem of the most efficient tests of statistical hypotheses. Philos Trans R Soc Lond Ser A. 1933;231:289–337.CrossRef Neyman J, Pearson ES. On the problem of the most efficient tests of statistical hypotheses. Philos Trans R Soc Lond Ser A. 1933;231:289–337.CrossRef
10.
go back to reference von Mises R. Probability, statistics and truth. 2nd ed. London: Allen and Unwin; 1928. von Mises R. Probability, statistics and truth. 2nd ed. London: Allen and Unwin; 1928.
11.
go back to reference Oakes M. Statistical inference: a commentary for the social and behavioural sciences. New Jersey: Wiley; 1986. Oakes M. Statistical inference: a commentary for the social and behavioural sciences. New Jersey: Wiley; 1986.
12.
go back to reference Dienes Z. Bayesian versus orthodox statistics: which side are you on? Perspect Psychol Sci. 2011;6(3):274–90.CrossRef Dienes Z. Bayesian versus orthodox statistics: which side are you on? Perspect Psychol Sci. 2011;6(3):274–90.CrossRef
13.
go back to reference Pollard P, Richardson JTE. On the probability of making type I errors. Psychol Bull. 1987;102(1):159–63.CrossRef Pollard P, Richardson JTE. On the probability of making type I errors. Psychol Bull. 1987;102(1):159–63.CrossRef
14.
go back to reference Cumming G. Understanding the new statistics: effect sizes, confidence intervals and meta analysis. New York: Taylor and Francis Group; 2012. Cumming G. Understanding the new statistics: effect sizes, confidence intervals and meta analysis. New York: Taylor and Francis Group; 2012.
15.
go back to reference Hopkins WG, Marshall SW, Batterham AM, Hanin J. Progressive statistics for studies in sports medicine and exercise science. Med Sci Sports Exerc. 2009;41(1):3–12.PubMedCrossRef Hopkins WG, Marshall SW, Batterham AM, Hanin J. Progressive statistics for studies in sports medicine and exercise science. Med Sci Sports Exerc. 2009;41(1):3–12.PubMedCrossRef
16.
go back to reference Edwards AWF. Likelihood. Cambridge: Cambridge University Press; 1972. Edwards AWF. Likelihood. Cambridge: Cambridge University Press; 1972.
18.
go back to reference Little RJ. Calibrated Bayes, for statistics in general, and missing data in particular. Stat Sci. 2011;26(2):162–74.CrossRef Little RJ. Calibrated Bayes, for statistics in general, and missing data in particular. Stat Sci. 2011;26(2):162–74.CrossRef
Metadata
Title
Distinguishing Between Statistical Significance and Practical/Clinical Meaningfulness Using Statistical Inference
Author
Michael Wilkinson
Publication date
01-03-2014
Publisher
Springer International Publishing
Published in
Sports Medicine / Issue 3/2014
Print ISSN: 0112-1642
Electronic ISSN: 1179-2035
DOI
https://doi.org/10.1007/s40279-013-0125-y

Other articles of this Issue 3/2014

Sports Medicine 3/2014 Go to the issue