Skip to main content
Top
Published in: Trials 1/2015

Open Access 01-12-2015 | Research

Sample size calculation for a stepped wedge trial

Authors: Gianluca Baio, Andrew Copas, Gareth Ambler, James Hargreaves, Emma Beard, Rumana Z Omar

Published in: Trials | Issue 1/2015

Login to get access

Abstract

Background

Stepped wedge trials (SWTs) can be considered as a variant of a clustered randomised trial, although in many ways they embed additional complications from the point of view of statistical design and analysis. While the literature is rich for standard parallel or clustered randomised clinical trials (CRTs), it is much less so for SWTs. The specific features of SWTs need to be addressed properly in the sample size calculations to ensure valid estimates of the intervention effect.

Methods

We critically review the available literature on analytical methods to perform sample size and power calculations in a SWT. In particular, we highlight the specific assumptions underlying currently used methods and comment on their validity and potential for extensions. Finally, we propose the use of simulation-based methods to overcome some of the limitations of analytical formulae. We performed a simulation exercise in which we compared simulation-based sample size computations with analytical methods and assessed the impact of varying the basic parameters to the resulting sample size/power, in the case of continuous and binary outcomes and assuming both cross-sectional data and the closed cohort design.

Results

We compared the sample size requirements for a SWT in comparison to CRTs based on comparable number of measurements in each cluster. In line with the existing literature, we found that when the level of correlation within the clusters is relatively high (for example, greater than 0.1), the SWT requires a smaller number of clusters. For low values of the intracluster correlation, the two designs produce more similar requirements in terms of total number of clusters. We validated our simulation-based approach and compared the results of sample size calculations to analytical methods; the simulation-based procedures perform well, producing results that are extremely similar to the analytical methods. We found that usually the SWT is relatively insensitive to variations in the intracluster correlation, and that failure to account for a potential time effect will artificially and grossly overestimate the power of a study.

Conclusions

We provide a framework for handling the sample size and power calculations of a SWT and suggest that simulation-based procedures may be more effective, especially in dealing with the specific features of the study at hand. In selected situations and depending on the level of intracluster correlation and the cluster size, SWTs may be more efficient than comparable CRTs. However, the decision about the design to be implemented will be based on a wide range of considerations, including the cost associated with the number of clusters, number of measurements and the trial duration.
Literature
1.
go back to reference Murray D. The design and analysis of group randomised trials. Oxford, UK: Oxford University Press; 1998. Murray D. The design and analysis of group randomised trials. Oxford, UK: Oxford University Press; 1998.
2.
go back to reference Gail M, Byar D, Pechacek T, Corle D. Aspects of statistical design for the Community Intervention Trial for Smoking Cessation (COMMIT). Control Clin Trials. 1992; 13:6–21.CrossRefPubMed Gail M, Byar D, Pechacek T, Corle D. Aspects of statistical design for the Community Intervention Trial for Smoking Cessation (COMMIT). Control Clin Trials. 1992; 13:6–21.CrossRefPubMed
3.
go back to reference Donner A, Birkett N, Buck C. Randomization by cluster: sample size requirements and analysis. Am J Epidemiol. 1981; 114:906–14.PubMed Donner A, Birkett N, Buck C. Randomization by cluster: sample size requirements and analysis. Am J Epidemiol. 1981; 114:906–14.PubMed
4.
go back to reference Donner A. Sample size requirements for stratified cluster randomization designs. Stat Med. 1992; 11:743–50.CrossRefPubMed Donner A. Sample size requirements for stratified cluster randomization designs. Stat Med. 1992; 11:743–50.CrossRefPubMed
5.
go back to reference Shoukri M, Martin S. Estimating the number of clusters for the analysis of correlated binary response variables from unbalanced data. Stat Med. 1992; 11:751–60.CrossRefPubMed Shoukri M, Martin S. Estimating the number of clusters for the analysis of correlated binary response variables from unbalanced data. Stat Med. 1992; 11:751–60.CrossRefPubMed
6.
go back to reference Shipley M, Smith P, Dramaix M. Calculation of power for matched pair studies when randomization is by group. Int J Epidemiol. 1989; 18:457–61.CrossRefPubMed Shipley M, Smith P, Dramaix M. Calculation of power for matched pair studies when randomization is by group. Int J Epidemiol. 1989; 18:457–61.CrossRefPubMed
7.
go back to reference Hsieh F. Sample size formulae for intervention studies with the cluster as unit of randomization. Stat Med. 1988; 8:1195–201.CrossRef Hsieh F. Sample size formulae for intervention studies with the cluster as unit of randomization. Stat Med. 1988; 8:1195–201.CrossRef
8.
go back to reference Donner A, Klar N. Design and analysis of cluster randomisation trials in health research. London, UK: Arnold; 2000. Donner A, Klar N. Design and analysis of cluster randomisation trials in health research. London, UK: Arnold; 2000.
9.
go back to reference Liu A, Shih W, Gehan E. Sample size and power determination for clustered repeated measurements. Stat Med. 2002; 21:1787–801.CrossRefPubMed Liu A, Shih W, Gehan E. Sample size and power determination for clustered repeated measurements. Stat Med. 2002; 21:1787–801.CrossRefPubMed
10.
go back to reference Hargreaves J, Copas A, Beard E, Osrin D, Lewis J, Davey C, et al.Five questions to consider before conducting a stepped wedge trial. Trials. 2015. Hargreaves J, Copas A, Beard E, Osrin D, Lewis J, Davey C, et al.Five questions to consider before conducting a stepped wedge trial. Trials. 2015.
11.
go back to reference Beard E, Lewis J, Prost A, Copas A, Davey C, Osrin D, et al.Stepped wedge randomised controlled trials: systematic review. Trials. 2015. Beard E, Lewis J, Prost A, Copas A, Davey C, Osrin D, et al.Stepped wedge randomised controlled trials: systematic review. Trials. 2015.
13.
go back to reference Mdege N, Man M, Brown C, Torgersen D. Systematic review of stepped wedge cluster randomised trials shows that design is particularly used to evaluate interventions during routine implementation. J Clin Epidemiol. 2011; 64:936–48.CrossRefPubMed Mdege N, Man M, Brown C, Torgersen D. Systematic review of stepped wedge cluster randomised trials shows that design is particularly used to evaluate interventions during routine implementation. J Clin Epidemiol. 2011; 64:936–48.CrossRefPubMed
14.
go back to reference Hussey M, Hughes J. Design and analysis of stepped wedge cluster randomised trials. Contemporary Clin Trials. 2007; 28:182–91.CrossRef Hussey M, Hughes J. Design and analysis of stepped wedge cluster randomised trials. Contemporary Clin Trials. 2007; 28:182–91.CrossRef
15.
go back to reference Woertman W, de Hoop E, Moerbeek M, Zuidema S, Gerritsen D, Teerenstra S. Stepped wedge designs could reduce the required sample size in cluster randomized trials. J Clin Epidemiol. 2013; 66(7):52–8.CrossRef Woertman W, de Hoop E, Moerbeek M, Zuidema S, Gerritsen D, Teerenstra S. Stepped wedge designs could reduce the required sample size in cluster randomized trials. J Clin Epidemiol. 2013; 66(7):52–8.CrossRef
16.
go back to reference Moulton L, Golub J, Burovni B, Cavalcante S, Pacheco A, Saraceni V, et al.Statistical design of THRio: a phased implementation clinic-randomized study of a tuberculosis preventive therapy intervention. Clin Trials. 2007; 4:190–9.CrossRefPubMed Moulton L, Golub J, Burovni B, Cavalcante S, Pacheco A, Saraceni V, et al.Statistical design of THRio: a phased implementation clinic-randomized study of a tuberculosis preventive therapy intervention. Clin Trials. 2007; 4:190–9.CrossRefPubMed
17.
go back to reference Hemming K, Lilford R, Girling A. Stepped-wedge cluster randomised controlled trials: a generic framework including parallel and multiple-level design. Stat Med. 2015 Jan 30; 34(2):181–196.CrossRefPubMed Hemming K, Lilford R, Girling A. Stepped-wedge cluster randomised controlled trials: a generic framework including parallel and multiple-level design. Stat Med. 2015 Jan 30; 34(2):181–196.CrossRefPubMed
18.
go back to reference Hemming K, Haines T, Chilton A, Girling A, Lilford R. The stepped wedge cluster randomised trial: rationale, design, analysis and reporting. Br Med J. 2015 Feb 6; 350:h391. doi:10.1136/bmj.h391.CrossRef Hemming K, Haines T, Chilton A, Girling A, Lilford R. The stepped wedge cluster randomised trial: rationale, design, analysis and reporting. Br Med J. 2015 Feb 6; 350:h391. doi:10.1136/bmj.h391.CrossRef
19.
go back to reference Handley M, Schillinger D, Shiboski S. Quasi-experimental designs in practice-based research settings: design and implementation considerations. J Am Board Fam Med. 2011; 24(5):589–96.CrossRefPubMed Handley M, Schillinger D, Shiboski S. Quasi-experimental designs in practice-based research settings: design and implementation considerations. J Am Board Fam Med. 2011; 24(5):589–96.CrossRefPubMed
20.
go back to reference Hemming K, Girling A. A menu-driven facility for power and detectable-difference calculations in stepped-wedge cluster-randomized trials. Stat J. 2014;14(2):363–380. Hemming K, Girling A. A menu-driven facility for power and detectable-difference calculations in stepped-wedge cluster-randomized trials. Stat J. 2014;14(2):363–380.
21.
go back to reference StataCorp. Stata 13 base reference Manual. College Station, TX: Stata Press; 2013. http://www.stata.com/. StataCorp. Stata 13 base reference Manual. College Station, TX: Stata Press; 2013. http://​www.​stata.​com/​.
22.
go back to reference Copas A, Lewis J, Thompson J, Davey C, Fielding K, Baio G, et al.Designing a stepped wedge trial: three main designs, carry-over effects and randomisation approaches. Trials. 2015. Copas A, Lewis J, Thompson J, Davey C, Fielding K, Baio G, et al.Designing a stepped wedge trial: three main designs, carry-over effects and randomisation approaches. Trials. 2015.
23.
go back to reference Hayes R, Bennett S. Simple sample size calculations for cluster randomised trials. Int J Epidemiol. 1999; 28:319–26.CrossRefPubMed Hayes R, Bennett S. Simple sample size calculations for cluster randomised trials. Int J Epidemiol. 1999; 28:319–26.CrossRefPubMed
24.
go back to reference Dimairo M, Bradburn M, Walters S. Sample size determination through power simulation; practical lessons from a stepped wedge cluster randomised trial (SW CRT). Trials. 2011; 12(1):26.CrossRef Dimairo M, Bradburn M, Walters S. Sample size determination through power simulation; practical lessons from a stepped wedge cluster randomised trial (SW CRT). Trials. 2011; 12(1):26.CrossRef
25.
go back to reference Gelman A, Hill J. Data analysis using regression and multilevel/hierarchical models. Cambridge, UK: Cambridge University Press; 2006.CrossRef Gelman A, Hill J. Data analysis using regression and multilevel/hierarchical models. Cambridge, UK: Cambridge University Press; 2006.CrossRef
26.
go back to reference Burton A, Altman D, Royston P, Holder R. The design of simulation studies in medical statistics. Stat Med. 2006; 25:4279–292.CrossRefPubMed Burton A, Altman D, Royston P, Holder R. The design of simulation studies in medical statistics. Stat Med. 2006; 25:4279–292.CrossRefPubMed
27.
go back to reference Landau S, Stahl S. Sample size and power calculations for medical studies by simulation when closed form expressions are not available. Stat Methods Med Res. 2013; 22(3):324–45.CrossRefPubMed Landau S, Stahl S. Sample size and power calculations for medical studies by simulation when closed form expressions are not available. Stat Methods Med Res. 2013; 22(3):324–45.CrossRefPubMed
28.
go back to reference Kitson A, Schultz T, Long L, Shanks A, Wiechula R, Chapman I, et al.The prevention and reduction of weight loss in an acute tertiary care setting: protocol for a pragmatic stepped wedge randomised cluster trial (the PRoWL project). BMC Health Serv Res. 2013;13(299). http://www.biomedcentral.com/1472-6963/13/2. Kitson A, Schultz T, Long L, Shanks A, Wiechula R, Chapman I, et al.The prevention and reduction of weight loss in an acute tertiary care setting: protocol for a pragmatic stepped wedge randomised cluster trial (the PRoWL project). BMC Health Serv Res. 2013;13(299). http://​www.​biomedcentral.​com/​1472-6963/​13/​2.
29.
go back to reference Schultz T, Kitson A, Soenen S, Long L, Shanks A, Wiechula R, Chapman I, Lange K. Does a multidisciplinary nutritional intervention prevent nutritional decline in hospital patients? A stepped wedge randomised cluster trial. e-SPEN J. 2014; 9(2):84–90.CrossRef Schultz T, Kitson A, Soenen S, Long L, Shanks A, Wiechula R, Chapman I, Lange K. Does a multidisciplinary nutritional intervention prevent nutritional decline in hospital patients? A stepped wedge randomised cluster trial. e-SPEN J. 2014; 9(2):84–90.CrossRef
30.
go back to reference Bacchieri G, Barros A, Santos J, Goncalves H, Gigante D. A community intervention to prevent traffic accidents among bicycle commuters. Revista de Saude Publica. 2010; 44(5):867–75.CrossRefPubMed Bacchieri G, Barros A, Santos J, Goncalves H, Gigante D. A community intervention to prevent traffic accidents among bicycle commuters. Revista de Saude Publica. 2010; 44(5):867–75.CrossRefPubMed
31.
go back to reference Spiegelhalter D, Abrams K, Myles J. Bayesian approaches to clinical trials and health-care evaluation. London, UK: Wiley and Sons; 2004. Spiegelhalter D, Abrams K, Myles J. Bayesian approaches to clinical trials and health-care evaluation. London, UK: Wiley and Sons; 2004.
32.
go back to reference Hemming K, Girling A, Martin J, Bond S. Stepped wedge cluster randomized trials are efficient and provide a method of evaluation without which some interventions would not be evaluated. J Clin Epidemiol. 2013; 66(9):1058–9.CrossRefPubMed Hemming K, Girling A, Martin J, Bond S. Stepped wedge cluster randomized trials are efficient and provide a method of evaluation without which some interventions would not be evaluated. J Clin Epidemiol. 2013; 66(9):1058–9.CrossRefPubMed
33.
go back to reference Duncan G, Kalton G. Issues of design and analysis of surveys across time. Int Stat Rev. 1987; 55:97–117.CrossRef Duncan G, Kalton G. Issues of design and analysis of surveys across time. Int Stat Rev. 1987; 55:97–117.CrossRef
34.
go back to reference R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2014. http://www.R-project.org. R Core Team. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing; 2014. http://​www.​R-project.​org.
36.
go back to reference Murrey D, Blitstein J. Methods to reduce the impact of intraclass correlation in group-randomised trials. Eval Rev. 2013; 27(1):79–103.CrossRef Murrey D, Blitstein J. Methods to reduce the impact of intraclass correlation in group-randomised trials. Eval Rev. 2013; 27(1):79–103.CrossRef
37.
go back to reference Keriel-Gascou M, Buchet-Poyau K, Rabilloud M, Duclos A, Colin C. A stepped wedge cluster randomized trial is preferable for assessing complex health interventions. J Clin Epidemiol. 2014; 67(7):831–3.CrossRefPubMed Keriel-Gascou M, Buchet-Poyau K, Rabilloud M, Duclos A, Colin C. A stepped wedge cluster randomized trial is preferable for assessing complex health interventions. J Clin Epidemiol. 2014; 67(7):831–3.CrossRefPubMed
38.
go back to reference de Hoop E, Woertman W, Teerenstra S. The stepped wedge cluster randomised trial always requires fewer clusters but not always fewer measurements, that is. participants than a parallel cluster randomised trial in a cross-sectional design. J Cli Epidemiol. 2013; 66:1428.CrossRef de Hoop E, Woertman W, Teerenstra S. The stepped wedge cluster randomised trial always requires fewer clusters but not always fewer measurements, that is. participants than a parallel cluster randomised trial in a cross-sectional design. J Cli Epidemiol. 2013; 66:1428.CrossRef
39.
go back to reference Kotz D, Spigt M, Arts I, Crutzen R, Viechtbauer W. The stepped wedge design does not inherently have more power than a cluster randomized controlled trial. J Clin Epidemiol. 2013; 66(9):1059–60.CrossRefPubMed Kotz D, Spigt M, Arts I, Crutzen R, Viechtbauer W. The stepped wedge design does not inherently have more power than a cluster randomized controlled trial. J Clin Epidemiol. 2013; 66(9):1059–60.CrossRefPubMed
40.
go back to reference Pearson D, Torgerson D, McDougall C, Bowles R. Parable of two agencies, one of which randomizes. Ann Am Acad Polit Soci Sci. 2010; 628:11–29.CrossRef Pearson D, Torgerson D, McDougall C, Bowles R. Parable of two agencies, one of which randomizes. Ann Am Acad Polit Soci Sci. 2010; 628:11–29.CrossRef
41.
go back to reference Feng Z, Diehr P, Peterson A, McLerran D. Selected statistical issues in group randomized trials. Annu Rev Public Health. 2001; 22:167.CrossRefPubMed Feng Z, Diehr P, Peterson A, McLerran D. Selected statistical issues in group randomized trials. Annu Rev Public Health. 2001; 22:167.CrossRefPubMed
42.
go back to reference Babyak M. What you see may not be what you get: a brief nontechnical introduction to overfitting in regression-type models. Psychosom Med. 2014; 66:411–21. Babyak M. What you see may not be what you get: a brief nontechnical introduction to overfitting in regression-type models. Psychosom Med. 2014; 66:411–21.
43.
go back to reference Eldridge S, Ashby D, Kerry S. Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. Int J Epidemiol. 2006; 35:1292–300.CrossRefPubMed Eldridge S, Ashby D, Kerry S. Sample size for cluster randomized trials: effect of coefficient of variation of cluster size and analysis method. Int J Epidemiol. 2006; 35:1292–300.CrossRefPubMed
Metadata
Title
Sample size calculation for a stepped wedge trial
Authors
Gianluca Baio
Andrew Copas
Gareth Ambler
James Hargreaves
Emma Beard
Rumana Z Omar
Publication date
01-12-2015
Publisher
BioMed Central
Published in
Trials / Issue 1/2015
Electronic ISSN: 1745-6215
DOI
https://doi.org/10.1186/s13063-015-0840-9

Other articles of this Issue 1/2015

Trials 1/2015 Go to the issue