Skip to main content
Top
Published in: Sports Medicine 1/2000

01-07-2000 | Current Opinion

Measures of Reliability in Sports Medicine and Science

Author: Dr Will G. Hopkins

Published in: Sports Medicine | Issue 1/2000

Login to get access

Abstract

Reliability refers to the reproducibility of values of a test, assay or other measurement in repeated trials on the same individuals. Better reliability implies better precision of single measurements and better tracking of changes in measurements in research or practical settings. The main measures of reliability are within-subject random variation, systematic change in the mean, and retest correlation. A simple, adaptable form of within-subject variation is the typical (standard) error of measurement: the standard deviation of an individual’s repeated measurements. For many measurements in sports medicine and science, the typical error is best expressed as a coefficient of variation (percentage of the mean). A biased, more limited form of within-subject variation is the limits of agreement: the 95% likely range of change of an individual’s measurements between 2 trials. Systematic changes in the mean of a measure between consecutive trials represent such effects as learning, motivation or fatigue; these changes need to be eliminated from estimates of within-subject variation. Retest correlation is difficult to interpret, mainly because its value is sensitive to the heterogeneity of the sample of participants. Uses of reliability include decision-making when monitoring individuals, comparison of tests or equipment, estimation of sample size in experiments and estimation of the magnitude of individual differences in the response to a treatment. Reasonable precision for estimates of reliability requires approximately 50 study participants and at least 3 trials. Studies aimed at assessing variation in reliability between tests or equipment require complex designs and analyses that researchers seldom perform correctly. A wider understanding of reliability and adoption of the typical error as the standard measure of reliability would improve the assessment of tests and equipment in our disciplines.
Literature
1.
go back to reference Atkinson G, Nevill AM. Statistical methods for addressing measurement error (reliability) in variables relevant to sports medicine. Sports Med 1998; 26: 217–38PubMedCrossRef Atkinson G, Nevill AM. Statistical methods for addressing measurement error (reliability) in variables relevant to sports medicine. Sports Med 1998; 26: 217–38PubMedCrossRef
2.
go back to reference Hopkins WG, Hawley JA, Burke LM. Design and analysis of research on sport performance enhancement. Med Sci Sports Exerc 1999; 31: 472–85PubMedCrossRef Hopkins WG, Hawley JA, Burke LM. Design and analysis of research on sport performance enhancement. Med Sci Sports Exerc 1999; 31: 472–85PubMedCrossRef
3.
go back to reference Nevill AM, Atkinson G. Assessing agreement between measurements recorded on a ratio scale in sports medicine and sports science. Br J Sports Med 1997; 31: 314–8PubMedCrossRef Nevill AM, Atkinson G. Assessing agreement between measurements recorded on a ratio scale in sports medicine and sports science. Br J Sports Med 1997; 31: 314–8PubMedCrossRef
4.
go back to reference Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986 Feb; 8: 307–10CrossRef Bland JM, Altman DG. Statistical methods for assessing agreement between two methods of clinical measurement. Lancet 1986 Feb; 8: 307–10CrossRef
5.
go back to reference Roebroeck ME, Harlaar J, Lankhorst GJ. The application of generalizability theory to reliability assessment: an illustration using isometric force measurements. Phys Ther 1993; 73: 386–401PubMed Roebroeck ME, Harlaar J, Lankhorst GJ. The application of generalizability theory to reliability assessment: an illustration using isometric force measurements. Phys Ther 1993; 73: 386–401PubMed
6.
go back to reference VanLeeuwen DM, Barnes MD, Pase M. Generalizability theory: a unified approach to assessing the dependability (reliability) of measurements in the health sciences. J Outcome Measures 1998; 2: 302–25 VanLeeuwen DM, Barnes MD, Pase M. Generalizability theory: a unified approach to assessing the dependability (reliability) of measurements in the health sciences. J Outcome Measures 1998; 2: 302–25
7.
go back to reference Bartko JJ. The intraclass correlation coefficient as a measure of reliability. Psych Reports 1966; 19: 3–11CrossRef Bartko JJ. The intraclass correlation coefficient as a measure of reliability. Psych Reports 1966; 19: 3–11CrossRef
8.
go back to reference Kovaleski JE, Heitman RJ, Gurchiek LR, et al. Reliability and effects of leg dominance on lower extremity isokinetic force and work using the Closed Chain Rider System. J Sport Rehabil 1997; 6: 319–26 Kovaleski JE, Heitman RJ, Gurchiek LR, et al. Reliability and effects of leg dominance on lower extremity isokinetic force and work using the Closed Chain Rider System. J Sport Rehabil 1997; 6: 319–26
9.
go back to reference Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psych Bull 1979; 86: 420–8CrossRef Shrout PE, Fleiss JL. Intraclass correlations: uses in assessing rater reliability. Psych Bull 1979; 86: 420–8CrossRef
10.
go back to reference Kovaleski JE, Ingersoll CD, Knight KL, et al. Reliability of the BTE Dynatrac isotonic dynamometer. Isokinet Exerc Sci 1996; 6: 41–3 Kovaleski JE, Ingersoll CD, Knight KL, et al. Reliability of the BTE Dynatrac isotonic dynamometer. Isokinet Exerc Sci 1996; 6: 41–3
11.
go back to reference Hopkins WG. A new view of statistics. Available from: http://sportsci.org/resource/stats [Accessed 2000 Apr 18] Hopkins WG. A new view of statistics. Available from: http://​sportsci.​org/​resource/​stats [Accessed 2000 Apr 18]
12.
go back to reference Hopkins WG, Manly BFJ. Errors in assigning grades based on tests of finite validity. Res Q Exerc Sport 1989; 60: 180–2PubMed Hopkins WG, Manly BFJ. Errors in assigning grades based on tests of finite validity. Res Q Exerc Sport 1989; 60: 180–2PubMed
13.
go back to reference Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Mahwah (NJ): Lawrence Erlbaum, 1988 Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Mahwah (NJ): Lawrence Erlbaum, 1988
14.
go back to reference Eliasziw M, Young SL, Woodbury MG, et al. Statistical methodology for the concurrent assessment of interrater and intrarater reliability: using goniometric measurements as an example. Phys Ther 1994; 74: 777–88PubMed Eliasziw M, Young SL, Woodbury MG, et al. Statistical methodology for the concurrent assessment of interrater and intrarater reliability: using goniometric measurements as an example. Phys Ther 1994; 74: 777–88PubMed
15.
go back to reference Clark VR, Hopkins WG, Hawley JA, et al. Placebo effect of carbohydrate feedings during a 40-km cycling time trial. Med Sci Sports Exerc. In press Clark VR, Hopkins WG, Hawley JA, et al. Placebo effect of carbohydrate feedings during a 40-km cycling time trial. Med Sci Sports Exerc. In press
16.
go back to reference Hopkins WG, Wolfinger RD. Estimating ‘individual differences’ in the response to an experimental treatment [abstract]. Med Sci Sports Exerc 1998; 30 (5): S135 Hopkins WG, Wolfinger RD. Estimating ‘individual differences’ in the response to an experimental treatment [abstract]. Med Sci Sports Exerc 1998; 30 (5): S135
17.
go back to reference Tate RF, Klett GW. Optimal confidence intervals for the variance of a normal distribution. J Am Statist Assoc 1959; 54: 674–82CrossRef Tate RF, Klett GW. Optimal confidence intervals for the variance of a normal distribution. J Am Statist Assoc 1959; 54: 674–82CrossRef
18.
go back to reference Hopkins WG. Generalizing to a population. Available from: http://sportsci.org/resource/stats/generalize.html [Accessed 2000 Apr 18] Hopkins WG. Generalizing to a population. Available from: http://​sportsci.​org/​resource/​stats/​generalize.​html [Accessed 2000 Apr 18]
19.
go back to reference Hopkins WG. Reliability: calculations and more. Available from: http://sportsci.org/resource/stats/relycalc.html [Accessed 2000 Apr 18] Hopkins WG. Reliability: calculations and more. Available from: http://​sportsci.​org/​resource/​stats/​relycalc.​html [Accessed 2000 Apr 18]
20.
go back to reference Schabort EJ, Hopkins WG, Hawley JA, et al. High reliability of performance of well-trained rowers on a rowing ergometer. J Sports Sci 1999; 17: 627–32PubMedCrossRef Schabort EJ, Hopkins WG, Hawley JA, et al. High reliability of performance of well-trained rowers on a rowing ergometer. J Sports Sci 1999; 17: 627–32PubMedCrossRef
Metadata
Title
Measures of Reliability in Sports Medicine and Science
Author
Dr Will G. Hopkins
Publication date
01-07-2000
Publisher
Springer International Publishing
Published in
Sports Medicine / Issue 1/2000
Print ISSN: 0112-1642
Electronic ISSN: 1179-2035
DOI
https://doi.org/10.2165/00007256-200030010-00001

Other articles of this Issue 1/2000

Sports Medicine 1/2000 Go to the issue