Skip to main content
Top
Published in: BMC Medical Research Methodology 1/2024

Open Access 01-12-2024 | Cystectomy | Research

Development of the multivariate administrative data cystectomy model and its impact on misclassification bias

Authors: James Ross, Luke T. Lavallee, Duane Hickling, Carl van Walraven

Published in: BMC Medical Research Methodology | Issue 1/2024

Login to get access

Abstract

Background

Misclassification bias (MB) is the deviation of measured from true values due to incorrect case assignment. This study compared MB when cystectomy status was determined using administrative database codes vs. predicted cystectomy probability.

Methods

We identified every primary cystectomy-diversion type at a single hospital 2009–2019. We linked to claims data to measure true association of cystectomy with 30 patient and hospitalization factors. Associations were also measured when cystectomy status was assigned using billing codes and by cystectomy probability from multivariate logistic regression model with covariates from administrative data. MB was the difference between measured and true associations.

Results

500 people underwent cystectomy (0.12% of 428 677 hospitalizations). Sensitivity and positive predictive values for cystectomy codes were 97.1% and 58.6% for incontinent diversions and 100.0% and 48.4% for continent diversions, respectively. The model accurately predicted cystectomy-incontinent diversion (c-statistic [C] 0.999, Integrated Calibration Index [ICI] 0.000) and cystectomy-continent diversion (C:1.000, ICI 0.000) probabilities. MB was significantly lower when model-based predictions was used to impute cystectomy-diversion type status using for both incontinent cystectomy (F = 12.75; p < .0001) and continent cystectomy (F = 11.25; p < .0001).

Conclusions

A model using administrative data accurately returned the probability that cystectomy by diversion type occurred during a hospitalization. Using this model to impute cystectomy status minimized MB. Accuracy of administrative database research can be increased by using probabilistic imputation to determine case status instead of individual codes.
Appendix
Available only for authorised users
Literature
2.
go back to reference van Walraven C, Bennett C, Forster AJ. Administrative database research infrequently uses validated diagnostic or procedural codes. J Clin Epidemiol. 2011;64(10):1054–9.CrossRefPubMed van Walraven C, Bennett C, Forster AJ. Administrative database research infrequently uses validated diagnostic or procedural codes. J Clin Epidemiol. 2011;64(10):1054–9.CrossRefPubMed
3.
go back to reference Tan H, Clarke R, Chamie K, et al. Development and Validation of an Automated Method to Identify Patients Undergoing Radical Cystectomy for Bladder Cancer Using Natural Language Processing. Urology Practice. 2017;4(5):365–72.CrossRefPubMed Tan H, Clarke R, Chamie K, et al. Development and Validation of an Automated Method to Identify Patients Undergoing Radical Cystectomy for Bladder Cancer Using Natural Language Processing. Urology Practice. 2017;4(5):365–72.CrossRefPubMed
4.
go back to reference Lyon TD, Faraj KS, Brennan E, Tyson MD, Boorjian SA, Spaulding AC. Defining radical cystectomy using the ICD-10 procedure coding system. Urologic Oncology: Seminars and Original Investigations. 2022;40(4):165.CrossRef Lyon TD, Faraj KS, Brennan E, Tyson MD, Boorjian SA, Spaulding AC. Defining radical cystectomy using the ICD-10 procedure coding system. Urologic Oncology: Seminars and Original Investigations. 2022;40(4):165.CrossRef
5.
go back to reference Adamczyk A, Grammatopoulos G, van Walraven C. Minimizing misclassification bias with a model to identify acetabular fractures using health administrative data: A cohort study. Medicine (Baltimore). 2021;100(52):e28223.CrossRefPubMed Adamczyk A, Grammatopoulos G, van Walraven C. Minimizing misclassification bias with a model to identify acetabular fractures using health administrative data: A cohort study. Medicine (Baltimore). 2021;100(52):e28223.CrossRefPubMed
6.
go back to reference van Walraven C. Bootstrap imputation with a disease probability model minimizes bias from misclassification due to administrative database codes. J Clin Epidemiol. 2017;84:114–20.CrossRefPubMed van Walraven C. Bootstrap imputation with a disease probability model minimizes bias from misclassification due to administrative database codes. J Clin Epidemiol. 2017;84:114–20.CrossRefPubMed
7.
go back to reference Austin PC, Laupacis A. A tutorial on methods to estimating clinically and policy-meaningful measures of treatment effects in prospective observational studies: a review. Int J Biostat. 2011;7(1):6.CrossRefPubMedPubMedCentral Austin PC, Laupacis A. A tutorial on methods to estimating clinically and policy-meaningful measures of treatment effects in prospective observational studies: a review. Int J Biostat. 2011;7(1):6.CrossRefPubMedPubMedCentral
8.
go back to reference van Walraven C. Improved correction of misclassification bias with bootstrap imputation. Med Care. 2018;56(7):e39–45.CrossRefPubMed van Walraven C. Improved correction of misclassification bias with bootstrap imputation. Med Care. 2018;56(7):e39–45.CrossRefPubMed
9.
go back to reference Riley RD, Ensor J, Snell KIE, et al. Calculating the sample size required for developing a clinical prediction model. Br Med J. 2020;368: m441.CrossRef Riley RD, Ensor J, Snell KIE, et al. Calculating the sample size required for developing a clinical prediction model. Br Med J. 2020;368: m441.CrossRef
10.
go back to reference Sauerbrei W, Meier-Hirmer C, Benner A, Royston P. Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs. Comput Stat Data Anal. 2006;50(12):3464–85.CrossRef Sauerbrei W, Meier-Hirmer C, Benner A, Royston P. Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs. Comput Stat Data Anal. 2006;50(12):3464–85.CrossRef
11.
go back to reference Austin PC, Steyerberg EW. The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models. Stat Med. 2019;38(21):4051–65.CrossRefPubMedPubMedCentral Austin PC, Steyerberg EW. The Integrated Calibration Index (ICI) and related metrics for quantifying the calibration of logistic regression models. Stat Med. 2019;38(21):4051–65.CrossRefPubMedPubMedCentral
12.
go back to reference Steyerberg EW. Overfitting and optimism in prediction models. Clinical Prediction Models: A practical appoach to development, validation, and updating. 2nd ed. New York: Springer; 2019. p. 95–112. Steyerberg EW. Overfitting and optimism in prediction models. Clinical Prediction Models: A practical appoach to development, validation, and updating. 2nd ed. New York: Springer; 2019. p. 95–112.
14.
go back to reference Single GM, Predictor C. Analyzing receiver operating characteristic curves with SAS. Cary, N.C.: SAS Institute Inc.; 2007. p. 15–36. Single GM, Predictor C. Analyzing receiver operating characteristic curves with SAS. Cary, N.C.: SAS Institute Inc.; 2007. p. 15–36.
15.
go back to reference van Walraven C. A comparison of methods to correct for misclassification bias from administrative database diagnostic codes. Int J Epidemiol. 2018;47(2):605–16.CrossRefPubMed van Walraven C. A comparison of methods to correct for misclassification bias from administrative database diagnostic codes. Int J Epidemiol. 2018;47(2):605–16.CrossRefPubMed
Metadata
Title
Development of the multivariate administrative data cystectomy model and its impact on misclassification bias
Authors
James Ross
Luke T. Lavallee
Duane Hickling
Carl van Walraven
Publication date
01-12-2024
Publisher
BioMed Central
Keyword
Cystectomy
Published in
BMC Medical Research Methodology / Issue 1/2024
Electronic ISSN: 1471-2288
DOI
https://doi.org/10.1186/s12874-024-02199-1

Other articles of this Issue 1/2024

BMC Medical Research Methodology 1/2024 Go to the issue