Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2012

Open Access 01-12-2012 | Research article

t-tests, non-parametric tests, and large studies—a paradox of statistical practice?

Author: Morten W Fagerland

Published in: BMC Medical Research Methodology | Issue 1/2012

Login to get access

Abstract

Background

During the last 30 years, the median sample size of research studies published in high-impact medical journals has increased manyfold, while the use of non-parametric tests has increased at the expense of t-tests. This paper explores this paradoxical practice and illustrates its consequences.

Methods

A simulation study is used to compare the rejection rates of the Wilcoxon-Mann-Whitney (WMW) test and the two-sample t-test for increasing sample size. Samples are drawn from skewed distributions with equal means and medians but with a small difference in spread. A hypothetical case study is used for illustration and motivation.

Results

The WMW test produces, on average, smaller p-values than the t-test. This discrepancy increases with increasing sample size, skewness, and difference in spread. For heavily skewed data, the proportion of p<0.05 with the WMW test can be greater than 90% if the standard deviations differ by 10% and the number of observations is 1000 in each group. The high rejection rates of the WMW test should be interpreted as the power to detect that the probability that a random sample from one of the distributions is less than a random sample from the other distribution is greater than 50%.

Conclusions

Non-parametric tests are most useful for small studies. Using non-parametric tests in large studies may provide answers to the wrong question, thus confusing readers. For studies with a large sample size, t-tests and their corresponding confidence intervals can and should be used even for heavily skewed data.
Appendix
Available only for authorised users
Literature
1.
go back to reference Horton NJ, Switzer SS: Statistical methods in the journal. New Engl J Med. 2005, 353 (18): 1977-1979. 10.1056/NEJM200511033531823.CrossRefPubMed Horton NJ, Switzer SS: Statistical methods in the journal. New Engl J Med. 2005, 353 (18): 1977-1979. 10.1056/NEJM200511033531823.CrossRefPubMed
2.
go back to reference Emerson JD, Colditz GA: Use of statistical analysis in the New England Journal of Medicine. New Engl J Med. 1983, 309 (12): 709-713. 10.1056/NEJM198309223091206.CrossRefPubMed Emerson JD, Colditz GA: Use of statistical analysis in the New England Journal of Medicine. New Engl J Med. 1983, 309 (12): 709-713. 10.1056/NEJM198309223091206.CrossRefPubMed
3.
go back to reference Bland MJ: The tyranny of power: is there a better way to calculate sample size?. BMJ. 2009, 339: b3985-10.1136/bmj.b3985. [10.1136/bmj.b3985]CrossRefPubMed Bland MJ: The tyranny of power: is there a better way to calculate sample size?. BMJ. 2009, 339: b3985-10.1136/bmj.b3985. [10.1136/bmj.b3985]CrossRefPubMed
4.
go back to reference Skovlund E, Fenstad GU: Should we always choose a nonparametric test when comparing two apparently nonnormal distributions?. J Clin Epidemiol. 2001, 54: 86-92. 10.1016/S0895-4356(00)00264-X.CrossRefPubMed Skovlund E, Fenstad GU: Should we always choose a nonparametric test when comparing two apparently nonnormal distributions?. J Clin Epidemiol. 2001, 54: 86-92. 10.1016/S0895-4356(00)00264-X.CrossRefPubMed
5.
go back to reference Fagerland MW, Sandvik L: Performance of five two-sample location tests for skewed distributions with unequal variances. Contemp Clin Trials. 2009, 30: 490-496. 10.1016/j.cct.2009.06.007.CrossRefPubMed Fagerland MW, Sandvik L: Performance of five two-sample location tests for skewed distributions with unequal variances. Contemp Clin Trials. 2009, 30: 490-496. 10.1016/j.cct.2009.06.007.CrossRefPubMed
6.
go back to reference Altman DG: Practical Statistics For Medical Research. 1991, Boca Raton, FL: Chapman & Hall/CRC Altman DG: Practical Statistics For Medical Research. 1991, Boca Raton, FL: Chapman & Hall/CRC
7.
go back to reference Altman DG, Machin D, Bryant TN, Gardner MJ (eds): Statistics with Confidence (2nd edn). 2000, London: BMJ Books Altman DG, Machin D, Bryant TN, Gardner MJ (eds): Statistics with Confidence (2nd edn). 2000, London: BMJ Books
8.
go back to reference Bland M: An Introduction to Medical Statistics (3rd edn). 2000, Oxford: Oxford University Press Bland M: An Introduction to Medical Statistics (3rd edn). 2000, Oxford: Oxford University Press
9.
go back to reference Kirkwood BR, Sterne JAC: Essential Medical Statistics (2nd edn). 2003, Malden, MA: Blackwell Science, Inc. Kirkwood BR, Sterne JAC: Essential Medical Statistics (2nd edn). 2003, Malden, MA: Blackwell Science, Inc.
10.
11.
go back to reference Fagerland MW, Sandvik L: The Wilcoxon-Mann-Whitney test under scrutiny. Stat Med. 2009, 28: 1487-1497. 10.1002/sim.3561.CrossRefPubMed Fagerland MW, Sandvik L: The Wilcoxon-Mann-Whitney test under scrutiny. Stat Med. 2009, 28: 1487-1497. 10.1002/sim.3561.CrossRefPubMed
12.
go back to reference Kastrati A, Neumann FJ, Schulz S, Massberg S, Byrne RA, Ferenc M, et al: Abciximab and heparin versus bivalirudin for non-ST-elevation myocardial infarction. New Engl J Med. 2011, 365: 1980-1989. 10.1056/NEJMoa1109596.CrossRefPubMed Kastrati A, Neumann FJ, Schulz S, Massberg S, Byrne RA, Ferenc M, et al: Abciximab and heparin versus bivalirudin for non-ST-elevation myocardial infarction. New Engl J Med. 2011, 365: 1980-1989. 10.1056/NEJMoa1109596.CrossRefPubMed
13.
go back to reference Karim SSA, Naidoo K, Grobler A, Padayatchi N, Baxter C, Gray AL, et al: Integration of antiretroviral therapy with tuberculosis treatment. New Engl J Med. 2011, 365: 1492-1501. 10.1056/NEJMoa1014181.CrossRef Karim SSA, Naidoo K, Grobler A, Padayatchi N, Baxter C, Gray AL, et al: Integration of antiretroviral therapy with tuberculosis treatment. New Engl J Med. 2011, 365: 1492-1501. 10.1056/NEJMoa1014181.CrossRef
14.
go back to reference Rao SV, Kaltenbach LA, Weintraub WS, Row MT, Brindis RG, Rumsfield JS, et al: Prevalence and outcomes of same-day discharge after elective percutaneous coronary intervention among older patients. JAMA. 2011, 306 (13): 1461-1467. 10.1001/jama.2011.1409.CrossRefPubMed Rao SV, Kaltenbach LA, Weintraub WS, Row MT, Brindis RG, Rumsfield JS, et al: Prevalence and outcomes of same-day discharge after elective percutaneous coronary intervention among older patients. JAMA. 2011, 306 (13): 1461-1467. 10.1001/jama.2011.1409.CrossRefPubMed
15.
go back to reference Ferlitsch M, Reinhart K, Pramhas S, Wiener C, Gal O, Bannert C, et al: Sex-specific prevalence of adenomas, advanced adenomas, and colorectal cancer in individuals undergoing screening colonoscopy. JAMA. 2011, 306 (12): 1352-1358. 10.1001/jama.2011.1362.CrossRefPubMed Ferlitsch M, Reinhart K, Pramhas S, Wiener C, Gal O, Bannert C, et al: Sex-specific prevalence of adenomas, advanced adenomas, and colorectal cancer in individuals undergoing screening colonoscopy. JAMA. 2011, 306 (12): 1352-1358. 10.1001/jama.2011.1362.CrossRefPubMed
16.
go back to reference Parodi G, Marucci R, Valenti R, Gori AM, Migliorini A, Giusti B, et al: High residual platelet reactivity after clopidogrel loading and long-term cardiovascular events among patients with acute coronary syndromes undergoing PCI. JAMA. 2011, 306 (11): 1215-1223. 10.1001/jama.2011.1332.CrossRefPubMed Parodi G, Marucci R, Valenti R, Gori AM, Migliorini A, Giusti B, et al: High residual platelet reactivity after clopidogrel loading and long-term cardiovascular events among patients with acute coronary syndromes undergoing PCI. JAMA. 2011, 306 (11): 1215-1223. 10.1001/jama.2011.1332.CrossRefPubMed
17.
go back to reference Christoffersen M, Frikke-Schmidt R, Schnohr P, Jensen GB, Nordestgaard BG, Tybjærg-Hansen A: Xanthelasmata, arcus corneae, and ischaemic vascular disease and death in general population: prospective cohort study. BMJ. 2011, 343: d5497-10.1136/bmj.d5497.CrossRefPubMedPubMedCentral Christoffersen M, Frikke-Schmidt R, Schnohr P, Jensen GB, Nordestgaard BG, Tybjærg-Hansen A: Xanthelasmata, arcus corneae, and ischaemic vascular disease and death in general population: prospective cohort study. BMJ. 2011, 343: d5497-10.1136/bmj.d5497.CrossRefPubMedPubMedCentral
18.
go back to reference Kühnast C, Neuhäuser M: A note on the use of the non-parametric Wilcoxon-Mann-Whitney test in the analysis of medical studies. GMS Ger Med Sci. 2008, 6: Doc02-PubMed Kühnast C, Neuhäuser M: A note on the use of the non-parametric Wilcoxon-Mann-Whitney test in the analysis of medical studies. GMS Ger Med Sci. 2008, 6: Doc02-PubMed
Metadata
Title
t-tests, non-parametric tests, and large studies—a paradox of statistical practice?
Author
Morten W Fagerland
Publication date
01-12-2012
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2012
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-12-78

Other articles of this Issue 1/2012

BMC Medical Research Methodology 1/2012 Go to the issue