Skip to main content
Top

Open Access 25-04-2024 | Colorectal Cancer | Original Article

Development and Validation of a Colorectal Cancer Prediction Model: A Nationwide Cohort-Based Study

Authors: Ofer Isakov, Dan Riesel, Michael Leshchinsky, Galit Shaham, Ben Y. Reis, Dan Keret, Zohar Levi, Baruch Brener, Ran Balicer, Noa Dagan, Samah Hayek

Published in: Digestive Diseases and Sciences

Login to get access

Abstract

Background

Early diagnosis of colorectal cancer (CRC) is critical to increasing survival rates. Computerized risk prediction models hold great promise for identifying individuals at high risk for CRC. In order to utilize such models effectively in a population-wide screening setting, development and validation should be based on cohorts that are similar to the target population.

Aim

Establish a risk prediction model for CRC diagnosis based on electronic health records (EHR) from subjects eligible for CRC screening.

Methods

A retrospective cohort study utilizing the EHR data of Clalit Health Services (CHS). The study includes CHS members aged 50–74 who were eligible for CRC screening from January 2013 to January 2019. The model was trained to predict receiving a CRC diagnosis within 2 years of the index date. Approximately 20,000 EHR demographic and clinical features were considered.

Results

The study includes 2935 subjects with CRC diagnosis, and 1,133,457 subjects without CRC diagnosis. Incidence values of CRC among subjects in the top 1% risk scores were higher than baseline (2.3% vs 0.3%; lift 8.38; P value < 0.001). Cumulative event probabilities increased with higher model scores. Model-based risk stratification among subjects with a positive FOBT, identified subjects with more than twice the risk for CRC compared to FOBT alone.

Conclusions

We developed an individualized risk prediction model for CRC that can be utilized as a complementary decision support tool for healthcare providers to precisely identify subjects at high risk for CRC and refer them for confirmatory testing.
Appendix
Available only for authorised users
Literature
3.
go back to reference US Preventive Services Task Force, Davidson KW, Barry MJ, Mangione CM, Cabana M, Caughey AB et al. Screening for colorectal cancer: US Preventive Services Task Force recommendation statement. JAMA 2021;325:1965–77.CrossRef US Preventive Services Task Force, Davidson KW, Barry MJ, Mangione CM, Cabana M, Caughey AB et al. Screening for colorectal cancer: US Preventive Services Task Force recommendation statement. JAMA 2021;325:1965–77.CrossRef
4.
go back to reference Atkin WS, Edwards R, Kralj-Hans I, Wooldrage K, Hart AR, Northover JMA et al. Once-only flexible sigmoidoscopy screening in prevention of colorectal cancer: a multicentre randomised controlled trial. Lancet. 2010;375:1624–1633.CrossRefPubMed Atkin WS, Edwards R, Kralj-Hans I, Wooldrage K, Hart AR, Northover JMA et al. Once-only flexible sigmoidoscopy screening in prevention of colorectal cancer: a multicentre randomised controlled trial. Lancet. 2010;375:1624–1633.CrossRefPubMed
5.
go back to reference Bretthauer M, Løberg M, Wieszczy P, Kalager M, Emilsson L, Garborg K et al. Effect of colonoscopy screening on risks of colorectal cancer and related death. N Engl J Med 2022;387:1547–56.CrossRefPubMed Bretthauer M, Løberg M, Wieszczy P, Kalager M, Emilsson L, Garborg K et al. Effect of colonoscopy screening on risks of colorectal cancer and related death. N Engl J Med 2022;387:1547–56.CrossRefPubMed
6.
go back to reference Levin B, Lieberman DA, McFarland B, Smith RA, Brooks D, Andrews KS et al. Screening and Surveillance for the Early Detection of Colorectal Cancer and Adenomatous Polyps, 2008: A Joint Guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. CA Cancer J Clin. 2008;58:130–60.CrossRefPubMed Levin B, Lieberman DA, McFarland B, Smith RA, Brooks D, Andrews KS et al. Screening and Surveillance for the Early Detection of Colorectal Cancer and Adenomatous Polyps, 2008: A Joint Guideline from the American Cancer Society, the US Multi-Society Task Force on Colorectal Cancer, and the American College of Radiology. CA Cancer J Clin. 2008;58:130–60.CrossRefPubMed
7.
go back to reference Fisher DA, Princic N, Miller-Wilson L-A, Wilson K, Fendrick AM, Limburg P. Utilization of a colorectal cancer screening test among individuals with average risk. JAMA Network Open. 2021;4:e2122269.CrossRefPubMedPubMedCentral Fisher DA, Princic N, Miller-Wilson L-A, Wilson K, Fendrick AM, Limburg P. Utilization of a colorectal cancer screening test among individuals with average risk. JAMA Network Open. 2021;4:e2122269.CrossRefPubMedPubMedCentral
8.
go back to reference Aleksandrova K, Reichmann R, Kaaks R, Jenab M, Bueno-de-Mesquita HB, Dahm CC et al. Development and validation of a lifestyle-based model for colorectal cancer risk prediction: the LiFeCRC score. BMC Med. 2021;19:1.CrossRefPubMedPubMedCentral Aleksandrova K, Reichmann R, Kaaks R, Jenab M, Bueno-de-Mesquita HB, Dahm CC et al. Development and validation of a lifestyle-based model for colorectal cancer risk prediction: the LiFeCRC score. BMC Med. 2021;19:1.CrossRefPubMedPubMedCentral
9.
go back to reference Kinar Y, Kalkstein N, Akiva P, Levin B, Half EE, Goldshtein I et al. Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: a binational retrospective study. J Am Med Inform Assoc. 2016;23:879–890.CrossRefPubMedPubMedCentral Kinar Y, Kalkstein N, Akiva P, Levin B, Half EE, Goldshtein I et al. Development and validation of a predictive model for detection of colorectal cancer in primary care by analysis of complete blood counts: a binational retrospective study. J Am Med Inform Assoc. 2016;23:879–890.CrossRefPubMedPubMedCentral
10.
go back to reference Lee E, Jung SY, Hwang HJ, Jung J. Patient-level cancer prediction models from a nationwide patient cohort: model development and validation. JMIR Med Inform 2021;9:e29807-08.CrossRefPubMedPubMedCentral Lee E, Jung SY, Hwang HJ, Jung J. Patient-level cancer prediction models from a nationwide patient cohort: model development and validation. JMIR Med Inform 2021;9:e29807-08.CrossRefPubMedPubMedCentral
11.
go back to reference Xu W, Mesa-Eguiagaray I, Kirkpatrick T, Devlin J, Brogan S, Turner P et al. Development and validation of risk prediction models for colorectal cancer in patients with symptoms. J Pers Med 2023;13:1065.CrossRefPubMedPubMedCentral Xu W, Mesa-Eguiagaray I, Kirkpatrick T, Devlin J, Brogan S, Turner P et al. Development and validation of risk prediction models for colorectal cancer in patients with symptoms. J Pers Med 2023;13:1065.CrossRefPubMedPubMedCentral
12.
go back to reference Yang J, McDowell A, Kim EK, Seo H, Lee WH, Moon C-M et al. Development of a colorectal cancer diagnostic model and dietary risk assessment through gut microbiome analysis. Exp Mol Med. 2019;51:1–15.PubMedPubMedCentral Yang J, McDowell A, Kim EK, Seo H, Lee WH, Moon C-M et al. Development of a colorectal cancer diagnostic model and dietary risk assessment through gut microbiome analysis. Exp Mol Med. 2019;51:1–15.PubMedPubMedCentral
13.
go back to reference Liang H, Yang L, Tao L, Shi L, Yang W, Bai J et al. Data mining-based model and risk prediction of colorectal cancer by using secondary health data: a systematic review. Chin J Cancer Res. 2020;32:242–251.CrossRefPubMedPubMedCentral Liang H, Yang L, Tao L, Shi L, Yang W, Bai J et al. Data mining-based model and risk prediction of colorectal cancer by using secondary health data: a systematic review. Chin J Cancer Res. 2020;32:242–251.CrossRefPubMedPubMedCentral
14.
go back to reference Burnett B, Zhou S-M, Brophy S, Davies P, Ellis P, Kennedy J et al. Machine learning in colorectal cancer risk prediction from routinely collected data: a review. Diagnostics (Basel). 2023;13:301.CrossRefPubMedPubMedCentral Burnett B, Zhou S-M, Brophy S, Davies P, Ellis P, Kennedy J et al. Machine learning in colorectal cancer risk prediction from routinely collected data: a review. Diagnostics (Basel). 2023;13:301.CrossRefPubMedPubMedCentral
15.
go back to reference Dagan N, Barda N, Kepten E, Miron O, Perchik S, Katz MA et al. BNT162b2 mRNA covid-19 vaccine in a nationwide mass vaccination setting. New England Journal of Medicine 2021;384:1412–23.CrossRefPubMed Dagan N, Barda N, Kepten E, Miron O, Perchik S, Katz MA et al. BNT162b2 mRNA covid-19 vaccine in a nationwide mass vaccination setting. New England Journal of Medicine 2021;384:1412–23.CrossRefPubMed
16.
go back to reference Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2:56–67.CrossRefPubMedPubMedCentral Lundberg SM, Erion G, Chen H, DeGrave A, Prutkin JM, Nair B et al. From local explanations to global understanding with explainable AI for trees. Nat Mach Intell. 2020;2:56–67.CrossRefPubMedPubMedCentral
17.
go back to reference Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O et al. Scikit-learn: machine learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830. Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O et al. Scikit-learn: machine learning in Python. Journal of Machine Learning Research. 2011;12:2825–2830.
18.
go back to reference DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845.CrossRefPubMed DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–845.CrossRefPubMed
19.
go back to reference Jones RM, Devers KJ, Kuzel AJ, Woolf SH. Patient-reported barriers to colorectal cancer screening: a mixed-methods analysis. Am J Prev Med. 2010;38:508–516.CrossRefPubMedPubMedCentral Jones RM, Devers KJ, Kuzel AJ, Woolf SH. Patient-reported barriers to colorectal cancer screening: a mixed-methods analysis. Am J Prev Med. 2010;38:508–516.CrossRefPubMedPubMedCentral
20.
go back to reference Sawicki T, Ruszkowska M, Danielewicz A, Niedźwiedzka E, Arłukowicz T, Przybyłowicz KE. A review of colorectal cancer in terms of epidemiology, risk factors, development, symptoms and diagnosis. Cancers (Basel). 2021;13:2025.CrossRefPubMedPubMedCentral Sawicki T, Ruszkowska M, Danielewicz A, Niedźwiedzka E, Arłukowicz T, Przybyłowicz KE. A review of colorectal cancer in terms of epidemiology, risk factors, development, symptoms and diagnosis. Cancers (Basel). 2021;13:2025.CrossRefPubMedPubMedCentral
21.
go back to reference He M, Fang Z, Hang D, Wang F, Polychronidis G, Wang L et al. Circulating liver function markers and colorectal cancer risk: A prospective cohort study in the UK Biobank. International Journal of Cancer 2021;148:1867.CrossRefPubMed He M, Fang Z, Hang D, Wang F, Polychronidis G, Wang L et al. Circulating liver function markers and colorectal cancer risk: A prospective cohort study in the UK Biobank. International Journal of Cancer 2021;148:1867.CrossRefPubMed
22.
go back to reference Vulcan A, Manjer J, Ohlsson B. High blood glucose levels are associated with higher risk of colon cancer in men: a cohort study. BMC Cancer. 2017;17:842.CrossRefPubMedPubMedCentral Vulcan A, Manjer J, Ohlsson B. High blood glucose levels are associated with higher risk of colon cancer in men: a cohort study. BMC Cancer. 2017;17:842.CrossRefPubMedPubMedCentral
23.
go back to reference Yang Z, Tang H, Lu S, Sun X, Rao B. Relationship between serum lipid level and colorectal cancer: a systemic review and meta-analysis. BMJ Open 2022;12:e052373.CrossRefPubMedPubMedCentral Yang Z, Tang H, Lu S, Sun X, Rao B. Relationship between serum lipid level and colorectal cancer: a systemic review and meta-analysis. BMJ Open 2022;12:e052373.CrossRefPubMedPubMedCentral
24.
go back to reference Ameen S, Wong M-C, Yee K-C, Turner P. AI and clinical decision making: the limitations and risks of computational reductionism in bowel cancer screening. Appl Sci. 2022;12:3341–45.CrossRef Ameen S, Wong M-C, Yee K-C, Turner P. AI and clinical decision making: the limitations and risks of computational reductionism in bowel cancer screening. Appl Sci. 2022;12:3341–45.CrossRef
Metadata
Title
Development and Validation of a Colorectal Cancer Prediction Model: A Nationwide Cohort-Based Study
Authors
Ofer Isakov
Dan Riesel
Michael Leshchinsky
Galit Shaham
Ben Y. Reis
Dan Keret
Zohar Levi
Baruch Brener
Ran Balicer
Noa Dagan
Samah Hayek
Publication date
25-04-2024
Publisher
Springer US
Published in
Digestive Diseases and Sciences
Print ISSN: 0163-2116
Electronic ISSN: 1573-2568
DOI
https://doi.org/10.1007/s10620-024-08427-4
Obesity Clinical Trial Summary

At a glance: The STEP trials

A round-up of the STEP phase 3 clinical trials evaluating semaglutide for weight loss in people with overweight or obesity.

Developed by: Springer Medicine

Highlights from the ACC 2024 Congress

Year in Review: Pediatric cardiology

Watch Dr. Anne Marie Valente present the last year's highlights in pediatric and congenital heart disease in the official ACC.24 Year in Review session.

Year in Review: Pulmonary vascular disease

The last year's highlights in pulmonary vascular disease are presented by Dr. Jane Leopold in this official video from ACC.24.

Year in Review: Valvular heart disease

Watch Prof. William Zoghbi present the last year's highlights in valvular heart disease from the official ACC.24 Year in Review session.

Year in Review: Heart failure and cardiomyopathies

Watch this official video from ACC.24. Dr. Biykem Bozkurt discuss last year's major advances in heart failure and cardiomyopathies.