Skip to main content
Top
Published in: BMC Cancer 1/2024

Open Access 01-12-2024 | Thyroid Cancer | Research

Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches

Authors: Hongxi Wang, Chao Zhang, Qianrui Li, Tian Tian, Rui Huang, Jiajun Qiu, Rong Tian

Published in: BMC Cancer | Issue 1/2024

Login to get access

Abstract

Background

Although papillary thyroid cancer (PTC) patients are known to have an excellent prognosis, up to 30% of patients experience disease recurrence after initial treatment. Accurately predicting disease prognosis remains a challenge given that the predictive value of several predictors remains controversial. Thus, we investigated whether machine learning (ML) approaches based on comprehensive predictors can predict the risk of structural recurrence for PTC patients.

Methods

A total of 2244 patients treated with thyroid surgery and radioiodine were included. Twenty-nine perioperative variables consisting of four dimensions (demographic characteristics and comorbidities, tumor-related variables, lymph node (LN)-related variables, and metabolic and inflammatory markers) were analyzed. We applied five ML algorithms—logistic regression (LR), support vector machine (SVM), extreme gradient boosting (XGBoost), random forest (RF), and neural network (NN)—to develop the models. The area under the receiver operating characteristic (AUC-ROC) curve, calibration curve, and variable importance were used to evaluate the models’ performance.

Results

During a median follow-up of 45.5 months, 179 patients (8.0%) experienced structural recurrence. The non-stimulated thyroglobulin, LN dissection, number of LNs dissected, lymph node metastasis ratio, N stage, comorbidity of hypertension, comorbidity of diabetes, body mass index, and low-density lipoprotein were used to develop the models. All models showed a greater AUC (AUC = 0.738 to 0.767) than did the ATA risk stratification (AUC = 0.620, DeLong test: P < 0.01). The SVM, XGBoost, and RF model showed greater sensitivity (0.568, 0.595, 0.676), specificity (0.903, 0.857, 0.784), accuracy (0.875, 0.835, 0.775), positive predictive value (PPV) (0.344, 0.272, 0.219), negative predictive value (NPV) (0.959, 0.959, 0.964), and F1 score (0.429, 0.373, 0.331) than did the ATA risk stratification (sensitivity = 0.432, specificity = 0.770, accuracy = 0.742, PPV = 0.144, NPV = 0.938, F1 score = 0.216). The RF model had generally consistent calibration compared with the other models. The Tg and the LNR were the top 2 important variables in all the models, the N stage was the top 5 important variables in all the models.

Conclusions

The RF model achieved the expected prediction performance with generally good discrimination, calibration and interpretability in this study. This study sheds light on the potential of ML approaches for improving the accuracy of risk stratification for PTC patients.

Trial registration

Retrospectively registered at www.​chictr.​org.​cn (trial registration number: ChiCTR2300075574, date of registration: 2023-09-08).
Appendix
Available only for authorised users
Literature
1.
go back to reference Chen DW, et al. Disparities in risk perception of thyroid cancer recurrence and death. Cancer. 2020;126(7):1512–21.CrossRefPubMed Chen DW, et al. Disparities in risk perception of thyroid cancer recurrence and death. Cancer. 2020;126(7):1512–21.CrossRefPubMed
2.
go back to reference Cooper DS, et al. Revised American Thyroid Association management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid. 2009;19(11):1167–214.CrossRefPubMed Cooper DS, et al. Revised American Thyroid Association management guidelines for patients with thyroid nodules and differentiated thyroid cancer. Thyroid. 2009;19(11):1167–214.CrossRefPubMed
3.
go back to reference Haugen BR et al. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid, 2016. 26(1): p. 1-133. Haugen BR et al. 2015 American Thyroid Association Management Guidelines for Adult Patients with Thyroid Nodules and Differentiated Thyroid Cancer: The American Thyroid Association Guidelines Task Force on Thyroid Nodules and Differentiated Thyroid Cancer. Thyroid, 2016. 26(1): p. 1-133.
4.
go back to reference Parvathareddy SK et al. Lymph node ratio is superior to AJCC N stage for predicting recurrence in papillary thyroid carcinoma. Endocr Connect, 2022. 11(2). Parvathareddy SK et al. Lymph node ratio is superior to AJCC N stage for predicting recurrence in papillary thyroid carcinoma. Endocr Connect, 2022. 11(2).
5.
go back to reference Ito Y, et al. Prognostic factors for recurrence of papillary thyroid carcinoma in the lymph nodes, lung, and bone: analysis of 5,768 patients with average 10-year follow-up. World J Surg. 2012;36(6):1274–8.CrossRefPubMed Ito Y, et al. Prognostic factors for recurrence of papillary thyroid carcinoma in the lymph nodes, lung, and bone: analysis of 5,768 patients with average 10-year follow-up. World J Surg. 2012;36(6):1274–8.CrossRefPubMed
6.
go back to reference Lee J, et al. Clinical Value of Lymph Node Ratio Integration with the 8(th) Edition of the UICC TNM classification and 2015 ATA risk Stratification systems for recurrence prediction in papillary thyroid Cancer. Sci Rep. 2019;9(1):13361.CrossRefPubMedPubMedCentral Lee J, et al. Clinical Value of Lymph Node Ratio Integration with the 8(th) Edition of the UICC TNM classification and 2015 ATA risk Stratification systems for recurrence prediction in papillary thyroid Cancer. Sci Rep. 2019;9(1):13361.CrossRefPubMedPubMedCentral
7.
go back to reference Zhou C, Duan D, Liu S. Predictive value of a Prognostic Model based on lymphocyte-to-monocyte ratio before Radioiodine Therapy for recurrence of papillary thyroid carcinoma. Technol Cancer Res Treat. 2021;20:15330338211027910.CrossRefPubMedPubMedCentral Zhou C, Duan D, Liu S. Predictive value of a Prognostic Model based on lymphocyte-to-monocyte ratio before Radioiodine Therapy for recurrence of papillary thyroid carcinoma. Technol Cancer Res Treat. 2021;20:15330338211027910.CrossRefPubMedPubMedCentral
8.
go back to reference Offi C, et al. Clinical significance of neutrophil-to-lymphocyte ratio, lymphocyte-to-monocyte ratio, platelet-to-lymphocyte ratio and prognostic nutritional index in low-risk differentiated thyroid carcinoma. Acta Otorhinolaryngol Ital. 2021;41(1):31–8.CrossRefPubMedPubMedCentral Offi C, et al. Clinical significance of neutrophil-to-lymphocyte ratio, lymphocyte-to-monocyte ratio, platelet-to-lymphocyte ratio and prognostic nutritional index in low-risk differentiated thyroid carcinoma. Acta Otorhinolaryngol Ital. 2021;41(1):31–8.CrossRefPubMedPubMedCentral
9.
go back to reference Chen W, et al. Association of the preoperative inflammation-based scores with TNM Stage and recurrence in patients with papillary thyroid carcinoma: a Retrospective, Multicenter Analysis. Cancer Manag Res. 2020;12:1809–18.CrossRefPubMedPubMedCentral Chen W, et al. Association of the preoperative inflammation-based scores with TNM Stage and recurrence in patients with papillary thyroid carcinoma: a Retrospective, Multicenter Analysis. Cancer Manag Res. 2020;12:1809–18.CrossRefPubMedPubMedCentral
10.
go back to reference Rahman ST, et al. Obesity is Associated with BRAF(V600E)-Mutated thyroid Cancer. Thyroid. 2020;30(10):1518–27.CrossRefPubMed Rahman ST, et al. Obesity is Associated with BRAF(V600E)-Mutated thyroid Cancer. Thyroid. 2020;30(10):1518–27.CrossRefPubMed
11.
go back to reference Li CL, et al. Influence of body mass index on the clinicopathological features of 13,995 papillary thyroid tumors. J Endocrinol Invest. 2020;43(9):1283–99.CrossRefPubMed Li CL, et al. Influence of body mass index on the clinicopathological features of 13,995 papillary thyroid tumors. J Endocrinol Invest. 2020;43(9):1283–99.CrossRefPubMed
12.
go back to reference Alba AC, et al. Discrimination and calibration of clinical prediction models: users’ guides to the Medical Literature. JAMA. 2017;318(14):1377–84.CrossRefPubMed Alba AC, et al. Discrimination and calibration of clinical prediction models: users’ guides to the Medical Literature. JAMA. 2017;318(14):1377–84.CrossRefPubMed
13.
go back to reference Kong G, Lin K, Hu Y. Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU. BMC Med Inf Decis Mak. 2020;20(1):251.CrossRef Kong G, Lin K, Hu Y. Using machine learning methods to predict in-hospital mortality of sepsis patients in the ICU. BMC Med Inf Decis Mak. 2020;20(1):251.CrossRef
14.
15.
go back to reference Tong J, et al. Machine learning can predict total death after Radiofrequency ablation in Liver Cancer patients. Clin Med Insights Oncol. 2021;15:11795549211000017.CrossRefPubMedPubMedCentral Tong J, et al. Machine learning can predict total death after Radiofrequency ablation in Liver Cancer patients. Clin Med Insights Oncol. 2021;15:11795549211000017.CrossRefPubMedPubMedCentral
16.
go back to reference Liu WC, et al. Machine learning for the prediction of bone metastasis in patients with newly diagnosed thyroid cancer. Cancer Med. 2021;10(8):2802–11.CrossRefPubMedPubMedCentral Liu WC, et al. Machine learning for the prediction of bone metastasis in patients with newly diagnosed thyroid cancer. Cancer Med. 2021;10(8):2802–11.CrossRefPubMedPubMedCentral
17.
go back to reference Montazeri M, et al. Machine learning models in breast cancer survival prediction. Technol Health Care. 2016;24(1):31–42.CrossRefPubMed Montazeri M, et al. Machine learning models in breast cancer survival prediction. Technol Health Care. 2016;24(1):31–42.CrossRefPubMed
18.
go back to reference Jajroudi M, et al. Prediction of survival in thyroid cancer using data mining technique. Technol Cancer Res Treat. 2014;13(4):353–9.CrossRefPubMed Jajroudi M, et al. Prediction of survival in thyroid cancer using data mining technique. Technol Cancer Res Treat. 2014;13(4):353–9.CrossRefPubMed
19.
go back to reference Mourad M, et al. Thyroid Cancer Prognosis Sci Rep. 2020;10(1):5176. Machine Learning and Feature Selection Applied to SEER Data to Reliably Assess. Mourad M, et al. Thyroid Cancer Prognosis Sci Rep. 2020;10(1):5176. Machine Learning and Feature Selection Applied to SEER Data to Reliably Assess.
20.
go back to reference Banerjee M, Reyes-Gastelum D, Haymart MR. Treatment-free survival in patients with differentiated thyroid Cancer. J Clin Endocrinol Metab. 2018;103(7):2720–7.CrossRefPubMedPubMedCentral Banerjee M, Reyes-Gastelum D, Haymart MR. Treatment-free survival in patients with differentiated thyroid Cancer. J Clin Endocrinol Metab. 2018;103(7):2720–7.CrossRefPubMedPubMedCentral
21.
go back to reference Park YM, Lee B. Machine learning-based prediction model using clinico-pathologic factors for papillary thyroid carcinoma recurrence. Sci Rep, 2021. 11(1). Park YM, Lee B. Machine learning-based prediction model using clinico-pathologic factors for papillary thyroid carcinoma recurrence. Sci Rep, 2021. 11(1).
23.
go back to reference Borzooei S, et al. Machine learning for risk stratification of thyroid cancer patients: a 15-year cohort study. European Archives of Oto-Rhino-Laryngology; 2023. Borzooei S, et al. Machine learning for risk stratification of thyroid cancer patients: a 15-year cohort study. European Archives of Oto-Rhino-Laryngology; 2023.
24.
go back to reference Tuttle RM, Haugen B, Perrier ND. Updated American Joint Committee on Cancer/Tumor-Node-Metastasis Staging System for differentiated and anaplastic thyroid Cancer (Eighth Edition): what changed and why? Thyroid. 2017;27(6):751–6.CrossRefPubMedPubMedCentral Tuttle RM, Haugen B, Perrier ND. Updated American Joint Committee on Cancer/Tumor-Node-Metastasis Staging System for differentiated and anaplastic thyroid Cancer (Eighth Edition): what changed and why? Thyroid. 2017;27(6):751–6.CrossRefPubMedPubMedCentral
25.
go back to reference Tian T, et al. Prognostic implications of Preablation stimulated tg: a retrospective analysis of 2500 thyroid Cancer patients. J Clin Endocrinol Metabolism. 2021;106(11):e4688–97.CrossRef Tian T, et al. Prognostic implications of Preablation stimulated tg: a retrospective analysis of 2500 thyroid Cancer patients. J Clin Endocrinol Metabolism. 2021;106(11):e4688–97.CrossRef
26.
go back to reference Gravesteijn BY, et al. Missing Data in Prediction Research: a five-step Approach for multiple imputation, Illustrated in the CENTER-TBI Study. J Neurotrauma. 2021;38(13):1842–57.CrossRefPubMed Gravesteijn BY, et al. Missing Data in Prediction Research: a five-step Approach for multiple imputation, Illustrated in the CENTER-TBI Study. J Neurotrauma. 2021;38(13):1842–57.CrossRefPubMed
27.
go back to reference Lee JH, Huber JJ. Evaluation of multiple imputation with large proportions of Missing Data: how much is too much? Iran J Public Health. 2021;50(7):1372–80.PubMedPubMedCentral Lee JH, Huber JJ. Evaluation of multiple imputation with large proportions of Missing Data: how much is too much? Iran J Public Health. 2021;50(7):1372–80.PubMedPubMedCentral
29.
go back to reference Bennin KE, et al. An empirical study on the effectiveness of data resampling approaches for cross-project software defect prediction. IET Software. 2022;16(2):185–99.CrossRef Bennin KE, et al. An empirical study on the effectiveness of data resampling approaches for cross-project software defect prediction. IET Software. 2022;16(2):185–99.CrossRef
30.
go back to reference LASSO RR. A feature selection technique in predictive modeling for machine learning. in 2016 IEEE International Conference on Advances in Computer Applications (ICACA). 2016. LASSO RR. A feature selection technique in predictive modeling for machine learning. in 2016 IEEE International Conference on Advances in Computer Applications (ICACA). 2016.
31.
go back to reference Shmilovici A, Maimon O, Rokach L, Maimon O, Rokach^Editors L. 2005, Springer US: Boston, MA. 257–76. Shmilovici A, Maimon O, Rokach L, Maimon O, Rokach^Editors L. 2005, Springer US: Boston, MA. 257–76.
32.
go back to reference Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016. Chen T, Guestrin C. XGBoost: A Scalable Tree Boosting System. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 2016.
33.
go back to reference Liu Y, Wang Y, Zhang J. New Machine Learning Algorithm: Random Forest. Berlin, Heidelberg: Springer Berlin Heidelberg; 2012. Liu Y, Wang Y, Zhang J. New Machine Learning Algorithm: Random Forest. Berlin, Heidelberg: Springer Berlin Heidelberg; 2012.
34.
go back to reference Svozil D, Kvasnicka V, Pospichal J. Introduction to multi-layer feed-forward neural networks. Chemometr Intell Lab Syst. 1997;39(1):43–62.CrossRef Svozil D, Kvasnicka V, Pospichal J. Introduction to multi-layer feed-forward neural networks. Chemometr Intell Lab Syst. 1997;39(1):43–62.CrossRef
35.
go back to reference C., H., et al. Automatic Parameter Tuning using Bayesian Optimization Method. in 2019 IEEE Congress on Evolutionary Computation (CEC). 2019. C., H., et al. Automatic Parameter Tuning using Bayesian Optimization Method. in 2019 IEEE Congress on Evolutionary Computation (CEC). 2019.
36.
go back to reference DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.CrossRefPubMed DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44(3):837–45.CrossRefPubMed
37.
go back to reference Barrett BJ, Fardy JM, Parfrey PS, Barrett BJ, Parfrey PS, Barrett^Editors BJ. 2021, Springer US: New York, NY. 319–33. Barrett BJ, Fardy JM, Parfrey PS, Barrett BJ, Parfrey PS, Barrett^Editors BJ. 2021, Springer US: New York, NY. 319–33.
38.
go back to reference Vaisman F, et al. Spontaneous remission in thyroid cancer patients after biochemical incomplete response to initial therapy. Clin Endocrinol (Oxf). 2012;77(1):132–8.CrossRefPubMed Vaisman F, et al. Spontaneous remission in thyroid cancer patients after biochemical incomplete response to initial therapy. Clin Endocrinol (Oxf). 2012;77(1):132–8.CrossRefPubMed
39.
go back to reference Jang S, et al. Recurrence risk evaluation in patients with papillary thyroid carcinoma: Multicenter Machine Learning evaluation of lymph node variables. Cancers. 2023;15(2):550.CrossRefPubMedPubMedCentral Jang S, et al. Recurrence risk evaluation in patients with papillary thyroid carcinoma: Multicenter Machine Learning evaluation of lymph node variables. Cancers. 2023;15(2):550.CrossRefPubMedPubMedCentral
40.
go back to reference Li LR, et al. Hypertension was Associated with higher tumor stages in papillary thyroid Cancer: a large sample single-center study. Metab Syndr Relat Disord. 2022;20(8):466–72.CrossRefPubMed Li LR, et al. Hypertension was Associated with higher tumor stages in papillary thyroid Cancer: a large sample single-center study. Metab Syndr Relat Disord. 2022;20(8):466–72.CrossRefPubMed
41.
go back to reference Shi P, et al. Clinicopathological features and prognosis of papillary thyroid cancer patients with type 2 diabetes mellitus. Gland Surg. 2022;11(2):358–68.CrossRefPubMedPubMedCentral Shi P, et al. Clinicopathological features and prognosis of papillary thyroid cancer patients with type 2 diabetes mellitus. Gland Surg. 2022;11(2):358–68.CrossRefPubMedPubMedCentral
42.
43.
go back to reference Tresallet C, et al. The incidence of papillary thyroid carcinoma and outcomes in operative patients according to their body mass indices. Surgery. 2014;156(5):1145–52.CrossRefPubMed Tresallet C, et al. The incidence of papillary thyroid carcinoma and outcomes in operative patients according to their body mass indices. Surgery. 2014;156(5):1145–52.CrossRefPubMed
44.
go back to reference Wessler BS, et al. Clinical prediction models for Cardiovascular Disease: Tufts Predictive Analytics and comparative effectiveness clinical prediction Model Database. Circ Cardiovasc Qual Outcomes. 2015;8(4):368–75.CrossRefPubMedPubMedCentral Wessler BS, et al. Clinical prediction models for Cardiovascular Disease: Tufts Predictive Analytics and comparative effectiveness clinical prediction Model Database. Circ Cardiovasc Qual Outcomes. 2015;8(4):368–75.CrossRefPubMedPubMedCentral
45.
46.
go back to reference Ritter A et al. Detecting recurrence following lobectomy for thyroid Cancer: role of Thyroglobulin and Thyroglobulin antibodies. J Clin Endocrinol Metab, 2020. 105(6). Ritter A et al. Detecting recurrence following lobectomy for thyroid Cancer: role of Thyroglobulin and Thyroglobulin antibodies. J Clin Endocrinol Metab, 2020. 105(6).
47.
go back to reference Morris C, Yang JJ. Effectiveness of resampling methods in coping with imbalanced crash data: Crash type analysis and predictive modeling. Accid Anal Prev. 2021;159:106240.CrossRefPubMed Morris C, Yang JJ. Effectiveness of resampling methods in coping with imbalanced crash data: Crash type analysis and predictive modeling. Accid Anal Prev. 2021;159:106240.CrossRefPubMed
Metadata
Title
Development and validation of prediction models for papillary thyroid cancer structural recurrence using machine learning approaches
Authors
Hongxi Wang
Chao Zhang
Qianrui Li
Tian Tian
Rui Huang
Jiajun Qiu
Rong Tian
Publication date
01-12-2024
Publisher
BioMed Central
Published in
BMC Cancer / Issue 1/2024
Electronic ISSN: 1471-2407
DOI
https://doi.org/10.1186/s12885-024-12146-4

Other articles of this Issue 1/2024

BMC Cancer 1/2024 Go to the issue
Webinar | 19-02-2024 | 17:30 (CET)

Keynote webinar | Spotlight on antibody–drug conjugates in cancer

Antibody–drug conjugates (ADCs) are novel agents that have shown promise across multiple tumor types. Explore the current landscape of ADCs in breast and lung cancer with our experts, and gain insights into the mechanism of action, key clinical trials data, existing challenges, and future directions.

Dr. Véronique Diéras
Prof. Fabrice Barlesi
Developed by: Springer Medicine