Published in:

01-10-2010 | Letter to the Editor

Do split your epidemiological data

Authors: Fredrik A. Dahl, Jūratė Šaltytė Benth

Published in: European Journal of Epidemiology | Issue 10/2010

Excerpt

This letter is a response to the commentary of Kallberg et al. [1], in which they argue against data splitting as a way of protecting against false positive discoveries in scientific studies. When reading it, we were rather surprised that it failed to refer to the article [2], which was published relatively recently in this journal, and discusses the same issue. Kallberg et al. analyze a two-stage testing procedure that defines a finding as valid if statistically significant on the 5% level in each part. They correctly argue that this trivially gives a significance level of (0.05)² = 0.0025, and that there exist more powerful tests with the same significance level. On closer reading, Kallberg et al. appear to discuss the analysis of genomic data; a field in which this two-stage hypothesis testing procedure is indeed sometimes used. But why, then, is this published in a journal on epidemiology? And how could they avoid any reference to [2], which does discuss splitting of epidemiological data? …

At a glance: The STEP trials

Springer Medicine

Do split your epidemiological data

Excerpt

At a glance: The STEP trials

Springer Medicine

Excerpt

Please log in to get access to this content

Other articles of this Issue 10/2010

Occupational exposures and risk of pancreatic cancer

Use of tobacco products and gastrointestinal morbidity: an endoscopic population-based study (the Kalixanda study)

The PROgnostic Value of unrequested Information in Diagnostic Imaging (PROVIDI) Study: rationale and design

Early influences on cardiovascular and renal development

Physical activity, morbidity and mortality in twins: a 24-year prospective follow-up

Forecast of future premature mortality as a result of trends in obesity and smoking: nationwide cohort simulation study