Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2013

Open Access 01-12-2013 | Research article

Screening for data clustering in multicenter studies: the residual intraclass correlation

Authors: Laure Wynants, Dirk Timmerman, Tom Bourne, Sabine Van Huffel, Ben Van Calster

Published in: BMC Medical Research Methodology | Issue 1/2013

Login to get access

Abstract

Background

In multicenter studies, center-specific variations in measurements may arise for various reasons, such as low interrater reliability, differences in equipment, deviations from the protocol, sociocultural characteristics, and differences in patient populations due to e.g. local referral patterns. The aim of this research is to derive measures for the degree of clustering. We present a method to detect heavily clustered variables and to identify physicians with outlying measurements.

Methods

We use regression models with fixed effects to account for patient case-mix and a random cluster intercept to study clustering by physicians. We propose to use the residual intraclass correlation (RICC), the proportion of residual variance that is situated at the cluster level, to detect variables that are influenced by clustering. An RICC of 0 indicates that the variance in the measurements is not due to variation between clusters. We further suggest, where appropriate, to evaluate RICC in combination with R2, the proportion of variance that is explained by the fixed effects. Variables with a high R2 may have benefits that outweigh the disadvantages of clustering in terms of statistical analysis. We apply the proposed methods to a dataset collected for the development of models for ovarian tumor diagnosis. We study the variability in 18 tumor characteristics collected through ultrasound examination, 4 patient characteristics, and the serum marker CA-125 measured by 40 physicians on 2407 patients.

Results

The RICC showed large variation between variables: from 2.2% for age to 25.1% for the amount of fluid in the pouch of Douglas. Seven variables had an RICC above 15%, indicating that a considerable part of the variance is due to systematic differences at the physician level, rather than random differences at the patient level. Accounting for differences in ultrasound machine quality reduced the RICC for a number of blood flow measurements.

Conclusions

We recommend that the degree of data clustering is addressed during the monitoring and analysis of multicenter studies. The RICC is a useful tool that expresses the degree of clustering as a percentage. Specific applications are data quality monitoring and variable screening prior to the development of a prediction model.
Appendix
Available only for authorised users
Literature
1.
go back to reference Sprague S, Matta JM, Bhandari M, Dodgin D, Clark CR, Kregor P, Bradley G, Little L: Multicenter collaboration in observational research: improving generalizability and efficiency. J Bone Joint Surg Am. 2009, 91 (Suppl 3): 80-86.CrossRefPubMed Sprague S, Matta JM, Bhandari M, Dodgin D, Clark CR, Kregor P, Bradley G, Little L: Multicenter collaboration in observational research: improving generalizability and efficiency. J Bone Joint Surg Am. 2009, 91 (Suppl 3): 80-86.CrossRefPubMed
2.
go back to reference Senn S: Some controversies in planning and analysing multi-centre trials. Stat Med. 1998, 17 (15–16): 1753-1765. discussion 1799-1800CrossRefPubMed Senn S: Some controversies in planning and analysing multi-centre trials. Stat Med. 1998, 17 (15–16): 1753-1765. discussion 1799-1800CrossRefPubMed
3.
go back to reference Localio AR, Berlin JA, Ten Have TR, Kimmel SE: Adjustments for center in multicenter studies: an overview. Ann Intern Med. 2001, 135 (2): 112-123.CrossRefPubMed Localio AR, Berlin JA, Ten Have TR, Kimmel SE: Adjustments for center in multicenter studies: an overview. Ann Intern Med. 2001, 135 (2): 112-123.CrossRefPubMed
4.
go back to reference Shrout PE, Fleiss JL: Intraclass correlations: Uses in assessing rater reliability. Psychol Bull. 1979, 86 (2): 420-428.CrossRefPubMed Shrout PE, Fleiss JL: Intraclass correlations: Uses in assessing rater reliability. Psychol Bull. 1979, 86 (2): 420-428.CrossRefPubMed
5.
go back to reference McGraw KO, Wong SP: Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996, 1 (1): 30-46.CrossRef McGraw KO, Wong SP: Forming inferences about some intraclass correlation coefficients. Psychol Methods. 1996, 1 (1): 30-46.CrossRef
6.
go back to reference Rahim-Williams B, Riley JL, Williams AK, Fillingim RB: A quantitative review of ethnic group differences in experimental pain response: do biology, psychology, and culture matter?. Pain Med. 2012, 13 (4): 522-540.CrossRefPubMedPubMedCentral Rahim-Williams B, Riley JL, Williams AK, Fillingim RB: A quantitative review of ethnic group differences in experimental pain response: do biology, psychology, and culture matter?. Pain Med. 2012, 13 (4): 522-540.CrossRefPubMedPubMedCentral
7.
go back to reference Guthrie LB, Oken E, Sterne JA, Gillman MW, Patel R, Vilchuck K, Bogdanovich N, Kramer MS, Martin RM: Ongoing monitoring of data clustering in multicenter studies. BMC Med Res Methodol. 2012, 12: 29-CrossRefPubMedPubMedCentral Guthrie LB, Oken E, Sterne JA, Gillman MW, Patel R, Vilchuck K, Bogdanovich N, Kramer MS, Martin RM: Ongoing monitoring of data clustering in multicenter studies. BMC Med Res Methodol. 2012, 12: 29-CrossRefPubMedPubMedCentral
8.
go back to reference Snijders TAB, Bosker RJ: Multilevel analysis: an introduction to basic and advanced multilevel modeling. 2012, London: London: Sage, 2 Snijders TAB, Bosker RJ: Multilevel analysis: an introduction to basic and advanced multilevel modeling. 2012, London: London: Sage, 2
9.
go back to reference Ren S, Yang S, Lai S: Intraclass correlation coefficients and bootstrap methods of hierarchical binary outcomes. Stat Med. 2006, 25 (20): 3576-3588.CrossRefPubMed Ren S, Yang S, Lai S: Intraclass correlation coefficients and bootstrap methods of hierarchical binary outcomes. Stat Med. 2006, 25 (20): 3576-3588.CrossRefPubMed
10.
go back to reference Skrondal A, Rabe-Hesketh S: Prediction in multilevel generalized linear models. J R Stat Soc A Stat Soc. 2009, 172 (3): 659-687.CrossRef Skrondal A, Rabe-Hesketh S: Prediction in multilevel generalized linear models. J R Stat Soc A Stat Soc. 2009, 172 (3): 659-687.CrossRef
11.
go back to reference Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc Ser B (Methodol). 1995, 57 (1): 289-300. Benjamini Y, Hochberg Y: Controlling the false discovery rate: a practical and powerful approach to multiple testing. J Roy Stat Soc Ser B (Methodol). 1995, 57 (1): 289-300.
12.
go back to reference Kaijser J, Bourne T, Valentin L, Sayasneh A, Van Holsbeke C, Vergote I, Testa AC, Franchi D, Van Calster B, Timmerman D: Improving strategies for diagnosing ovarian cancer: a summary of the International Ovarian Tumor Analysis (IOTA) studies. Ultrasound Obstet Gynecol. 2013, 41 (1): 9-CrossRefPubMed Kaijser J, Bourne T, Valentin L, Sayasneh A, Van Holsbeke C, Vergote I, Testa AC, Franchi D, Van Calster B, Timmerman D: Improving strategies for diagnosing ovarian cancer: a summary of the International Ovarian Tumor Analysis (IOTA) studies. Ultrasound Obstet Gynecol. 2013, 41 (1): 9-CrossRefPubMed
13.
go back to reference Timmerman D, Testa AC, Bourne T, Ferrazzi E, Ameye L, Konstantinovic ML, Van Calster B, Collins WP, Vergote I, Van Huffel S, et al: Logistic regression model to distinguish between the benign and malignant adnexal mass before surgery: a multicenter study by the international ovarian tumor analysis group. J Clin Oncol. 2005, 23 (34): 8794-8801.CrossRefPubMed Timmerman D, Testa AC, Bourne T, Ferrazzi E, Ameye L, Konstantinovic ML, Van Calster B, Collins WP, Vergote I, Van Huffel S, et al: Logistic regression model to distinguish between the benign and malignant adnexal mass before surgery: a multicenter study by the international ovarian tumor analysis group. J Clin Oncol. 2005, 23 (34): 8794-8801.CrossRefPubMed
14.
go back to reference Timmerman D, Van Calster B, Testa AC, Guerriero S, Fischerova D, Lissoni AA, Van Holsbeke C, Fruscio R, Czekierdowski A, Jurkovic D, et al: Ovarian cancer prediction in adnexal masses using ultrasound-based logistic regression models: a temporal and external validation study by the IOTA group. Ultrasound Obstet Gynecol. 2010, 36 (2): 226-234.CrossRefPubMed Timmerman D, Van Calster B, Testa AC, Guerriero S, Fischerova D, Lissoni AA, Van Holsbeke C, Fruscio R, Czekierdowski A, Jurkovic D, et al: Ovarian cancer prediction in adnexal masses using ultrasound-based logistic regression models: a temporal and external validation study by the IOTA group. Ultrasound Obstet Gynecol. 2010, 36 (2): 226-234.CrossRefPubMed
15.
go back to reference Van Holsbeke C, Van Calster B, Bourne T, Ajossa S, Testa AC, Guerriero S, Fruscio R, Lissoni AA, Czekierdowski A, Savelli L, et al: External validation of diagnostic models to estimate the risk of malignancy in adnexal masses. Clin Canc Res. 2012, 18 (3): 815-825.CrossRef Van Holsbeke C, Van Calster B, Bourne T, Ajossa S, Testa AC, Guerriero S, Fruscio R, Lissoni AA, Czekierdowski A, Savelli L, et al: External validation of diagnostic models to estimate the risk of malignancy in adnexal masses. Clin Canc Res. 2012, 18 (3): 815-825.CrossRef
16.
go back to reference Van Holsbeke C, Van Calster B, Testa AC, Domali E, Lu C, Van Huffel S, Valentin L, Timmerman D: Prospective internal validation of mathematical models to predict malignancy in adnexal masses: results from the international ovarian tumor analysis study. Clin Canc Res. 2009, 15 (2): 684-691.CrossRef Van Holsbeke C, Van Calster B, Testa AC, Domali E, Lu C, Van Huffel S, Valentin L, Timmerman D: Prospective internal validation of mathematical models to predict malignancy in adnexal masses: results from the international ovarian tumor analysis study. Clin Canc Res. 2009, 15 (2): 684-691.CrossRef
17.
go back to reference World Health Organization classification of tumours: Pathology and genetics of the breast and female genital organs. 2003, Lyon: IARC Press World Health Organization classification of tumours: Pathology and genetics of the breast and female genital organs. 2003, Lyon: IARC Press
18.
go back to reference Steyerberg EW: Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. 2009, New York, NY: Springer US, New York, NYCrossRef Steyerberg EW: Clinical Prediction Models: A Practical Approach to Development, Validation, and Updating. 2009, New York, NY: Springer US, New York, NYCrossRef
19.
go back to reference Stiell IG, Wells GA: Methodologic standards for the development of clinical decision rules in emergency medicine. Ann Emerg Med. 1999, 33 (4): 437-447.CrossRefPubMed Stiell IG, Wells GA: Methodologic standards for the development of clinical decision rules in emergency medicine. Ann Emerg Med. 1999, 33 (4): 437-447.CrossRefPubMed
20.
go back to reference Harrell FE: Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. 2001, New York (N.Y.): SpringerCrossRef Harrell FE: Regression modeling strategies: with applications to linear models, logistic regression, and survival analysis. 2001, New York (N.Y.): SpringerCrossRef
22.
go back to reference Kahan BC, Morris TP: Analysis of multicentre trials with continuous outcomes: when and how should we account for centre effects?. Stat Med. 2013, 32 (7): 1136-1149.CrossRefPubMed Kahan BC, Morris TP: Analysis of multicentre trials with continuous outcomes: when and how should we account for centre effects?. Stat Med. 2013, 32 (7): 1136-1149.CrossRefPubMed
23.
go back to reference Bouwmeester W, Twisk J, Kappen T, Klei W, Moons K, Vergouwe Y: Prediction models for clustered data: comparison of a random intercept and standard regression model. BMC Med Res Methodol. 2013, 13 (1): 19-CrossRefPubMedPubMedCentral Bouwmeester W, Twisk J, Kappen T, Klei W, Moons K, Vergouwe Y: Prediction models for clustered data: comparison of a random intercept and standard regression model. BMC Med Res Methodol. 2013, 13 (1): 19-CrossRefPubMedPubMedCentral
24.
go back to reference Maas CJM, Hox JJ: Sufficient sample sizes for multilevel modeling. Methodol Eur J Res Meth Behav Soc Sci. 2005, 1 (3): 86-92. Maas CJM, Hox JJ: Sufficient sample sizes for multilevel modeling. Methodol Eur J Res Meth Behav Soc Sci. 2005, 1 (3): 86-92.
25.
go back to reference Moineddin R, Matheson FI, Glazier RH, et al: A simulation study of sample size for multilevel logistic regression models. BMC Med Res Methodol. 2007, 7 (34): 34-CrossRefPubMedPubMedCentral Moineddin R, Matheson FI, Glazier RH, et al: A simulation study of sample size for multilevel logistic regression models. BMC Med Res Methodol. 2007, 7 (34): 34-CrossRefPubMedPubMedCentral
26.
go back to reference Paccagnella O: Sample size and accuracy of estimates in multilevel models. Methodol Eur J Res Meth Behav Soc Sci. 2011, 7 (3): 111-120. Paccagnella O: Sample size and accuracy of estimates in multilevel models. Methodol Eur J Res Meth Behav Soc Sci. 2011, 7 (3): 111-120.
Metadata
Title
Screening for data clustering in multicenter studies: the residual intraclass correlation
Authors
Laure Wynants
Dirk Timmerman
Tom Bourne
Sabine Van Huffel
Ben Van Calster
Publication date
01-12-2013
Publisher
BioMed Central
Published in
BMC Medical Research Methodology / Issue 1/2013
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/1471-2288-13-128

Other articles of this Issue 1/2013

BMC Medical Research Methodology 1/2013 Go to the issue