Skip to main content
Top
Published in: BMC Medical Informatics and Decision Making 1/2023

Open Access 01-12-2023 | Metastasis | Research

The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model

Authors: Xuhai Zhao, Cong Jiang

Published in: BMC Medical Informatics and Decision Making | Issue 1/2023

Login to get access

Abstract

Objectives

This research was designed to compare the ability of different machine learning (ML) models and nomogram to predict distant metastasis in male breast cancer (MBC) patients and to interpret the optimal ML model by SHapley Additive exPlanations (SHAP) framework.

Methods

Four powerful ML models were developed using data from male breast cancer (MBC) patients in the SEER database between 2010 and 2015 and MBC patients from our hospital between 2010 and 2020. The area under curve (AUC) and Brier score were used to assess the capacity of different models. The Delong test was applied to compare the performance of the models. Univariable and multivariable analysis were conducted using logistic regression.

Results

Of 2351 patients were analyzed; 168 (7.1%) had distant metastasis (M1); 117 (5.0%) had bone metastasis, and 71 (3.0%) had lung metastasis. The median age at diagnosis is 68.0 years old. Most patients did not receive radiotherapy (1723, 73.3%) or chemotherapy (1447, 61.5%). The XGB model was the best ML model for predicting M1 in MBC patients. It showed the largest AUC value in the tenfold cross validation (AUC:0.884; SD:0.02), training (AUC:0.907; 95% CI: 0.899—0.917), testing (AUC:0.827; 95% CI: 0.802—0.857) and external validation (AUC:0.754; 95% CI: 0.739—0.771) sets. It also showed powerful ability in the prediction of bone metastasis (AUC: 0.880, 95% CI: 0.856—0.903 in the training set; AUC: 0.823, 95% CI:0.790—0.848 in the test set; AUC: 0.747, 95% CI: 0.727—0.764 in the external validation set) and lung metastasis (AUC: 0.906, 95% CI: 0.877—0.928 in training set; AUC: 0.859, 95% CI: 0.816—0.891 in the test set; AUC: 0.756, 95% CI: 0.732—0.777 in the external validation set). The AUC value of the XGB model was larger than that of nomogram in the training (0.907 vs 0.802) and external validation (0.754 vs 0.706) sets.

Conclusions

The XGB model is a better predictor of distant metastasis among MBC patients than other ML models and nomogram; furthermore, the XGB model is a powerful model for predicting bone and lung metastasis. Combining with SHAP values, it could help doctors intuitively understand the impact of each variable on outcome.
Appendix
Available only for authorised users
Literature
1.
go back to reference Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72:7–33.CrossRefPubMed Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2022. CA Cancer J Clin. 2022;72:7–33.CrossRefPubMed
2.
go back to reference Abdelwahab Yousef AJ. Male breast cancer: epidemiology and risk factors. Semin Oncol. 2017;44:267–72.CrossRefPubMed Abdelwahab Yousef AJ. Male breast cancer: epidemiology and risk factors. Semin Oncol. 2017;44:267–72.CrossRefPubMed
3.
go back to reference DeSantis CE, Ma J, Gaudet MM, Newman LA, Miller KD, Goding Sauer A, Jemal A, Siegel RL. Breast cancer statistics, 2019. CA Cancer J Clin. 2019;69:438–51.CrossRefPubMed DeSantis CE, Ma J, Gaudet MM, Newman LA, Miller KD, Goding Sauer A, Jemal A, Siegel RL. Breast cancer statistics, 2019. CA Cancer J Clin. 2019;69:438–51.CrossRefPubMed
4.
go back to reference Liu N, Johnson KJ, Ma CX. Male breast cancer: an updated surveillance, epidemiology, and end results data analysis. Clin Breast Cancer. 2018;18:e997–1002.CrossRefPubMed Liu N, Johnson KJ, Ma CX. Male breast cancer: an updated surveillance, epidemiology, and end results data analysis. Clin Breast Cancer. 2018;18:e997–1002.CrossRefPubMed
5.
go back to reference Anderson WF, Jatoi I, Tse J, Rosenberg PS. Male breast cancer: a population-based comparison with female breast cancer. J Clin Oncol. 2010;28:232–9.CrossRefPubMed Anderson WF, Jatoi I, Tse J, Rosenberg PS. Male breast cancer: a population-based comparison with female breast cancer. J Clin Oncol. 2010;28:232–9.CrossRefPubMed
6.
go back to reference Kwong A, Chau WW, Mang OW, Wong CH, Suen DT, Leung R, Wong K, Lee A, Shea C, Morse E, et al. Male breast cancer: a population-based comparison with female breast cancer in Hong Kong, Southern China: 1997–2006. Ann Surg Oncol. 2014;21:1246–53.CrossRefPubMed Kwong A, Chau WW, Mang OW, Wong CH, Suen DT, Leung R, Wong K, Lee A, Shea C, Morse E, et al. Male breast cancer: a population-based comparison with female breast cancer in Hong Kong, Southern China: 1997–2006. Ann Surg Oncol. 2014;21:1246–53.CrossRefPubMed
7.
go back to reference Miao H, Verkooijen HM, Chia KS, Bouchardy C, Pukkala E, Laronningen S, Mellemkjaer L, Czene K, Hartman M. Incidence and outcome of male breast cancer: an international population-based study. J Clin Oncol. 2011;29:4381–6.CrossRefPubMed Miao H, Verkooijen HM, Chia KS, Bouchardy C, Pukkala E, Laronningen S, Mellemkjaer L, Czene K, Hartman M. Incidence and outcome of male breast cancer: an international population-based study. J Clin Oncol. 2011;29:4381–6.CrossRefPubMed
8.
go back to reference Xie J, Ying YY, Xu B, Li Y, Zhang X, Li C. Metastasis pattern and prognosis of male breast cancer patients in US: a population-based study from SEER database. Ther Adv Med Oncol. 2019;11:1758835919889003.CrossRefPubMedPubMedCentral Xie J, Ying YY, Xu B, Li Y, Zhang X, Li C. Metastasis pattern and prognosis of male breast cancer patients in US: a population-based study from SEER database. Ther Adv Med Oncol. 2019;11:1758835919889003.CrossRefPubMedPubMedCentral
9.
go back to reference Scomersi S, Giudici F, Cacciatore G, Losurdo P, Fracon S, Cortinovis S, Ceccherini R, Zanconati F, Tonutti M, Bortul M. Comparison between male and female breast cancer survival using propensity score matching analysis. Sci Rep. 2021;11:11639.CrossRefPubMedPubMedCentral Scomersi S, Giudici F, Cacciatore G, Losurdo P, Fracon S, Cortinovis S, Ceccherini R, Zanconati F, Tonutti M, Bortul M. Comparison between male and female breast cancer survival using propensity score matching analysis. Sci Rep. 2021;11:11639.CrossRefPubMedPubMedCentral
10.
go back to reference Foerster R, Foerster FG, Wulff V, Schubotz B, Baaske D, Wolfgarten M, Kuhn WC, Rudlowski C. Matched-pair analysis of patients with female and male breast cancer: a comparative analysis. BMC Cancer. 2011;11:335.CrossRefPubMedPubMedCentral Foerster R, Foerster FG, Wulff V, Schubotz B, Baaske D, Wolfgarten M, Kuhn WC, Rudlowski C. Matched-pair analysis of patients with female and male breast cancer: a comparative analysis. BMC Cancer. 2011;11:335.CrossRefPubMedPubMedCentral
11.
go back to reference Sekeroglu B, Tuncal K. Prediction of cancer incidence rates for the European continent using machine learning models. Health Informatics J. 2021;27:1460458220983878.CrossRefPubMed Sekeroglu B, Tuncal K. Prediction of cancer incidence rates for the European continent using machine learning models. Health Informatics J. 2021;27:1460458220983878.CrossRefPubMed
12.
go back to reference Saba T. Recent advancement in cancer detection using machine learning: systematic survey of decades, comparisons and challenges. J Infect Public Health. 2020;13:1274–89.CrossRefPubMed Saba T. Recent advancement in cancer detection using machine learning: systematic survey of decades, comparisons and challenges. J Infect Public Health. 2020;13:1274–89.CrossRefPubMed
13.
go back to reference Montazeri M, Montazeri M, Montazeri M, Beigzadeh A. Machine learning models in breast cancer survival prediction. Technol Health Care. 2016;24:31–42.CrossRefPubMed Montazeri M, Montazeri M, Montazeri M, Beigzadeh A. Machine learning models in breast cancer survival prediction. Technol Health Care. 2016;24:31–42.CrossRefPubMed
14.
go back to reference Ji L, Zhang W, Huang J, Tian J, Zhong X, Luo J, Zhu S, He Z, Tong Y, Meng X, et al. Bone metastasis risk and prognosis assessment models for kidney cancer based on machine learning. Front Public Health. 2022;10:1015952.CrossRefPubMedPubMedCentral Ji L, Zhang W, Huang J, Tian J, Zhong X, Luo J, Zhu S, He Z, Tong Y, Meng X, et al. Bone metastasis risk and prognosis assessment models for kidney cancer based on machine learning. Front Public Health. 2022;10:1015952.CrossRefPubMedPubMedCentral
15.
go back to reference Lundberg S, Lee SI: A Unified Approach to Interpreting Model Predictions. In: Nips: 2017. p. 2017. Lundberg S, Lee SI: A Unified Approach to Interpreting Model Predictions. In: Nips: 2017. p. 2017.
16.
go back to reference AlJame M, Ahmad I, Imtiaz A, Mohammed A. Ensemble learning model for diagnosing COVID-19 from routine blood tests. Inform Med Unlocked. 2020;21:100449.CrossRefPubMedPubMedCentral AlJame M, Ahmad I, Imtiaz A, Mohammed A. Ensemble learning model for diagnosing COVID-19 from routine blood tests. Inform Med Unlocked. 2020;21:100449.CrossRefPubMedPubMedCentral
18.
19.
go back to reference Kamarudin AN, Cox T, Kolamunnage-Dona R. Time-dependent ROC curve analysis in medical research: current methods and applications. BMC Med Res Methodol. 2017;17:53.CrossRefPubMedPubMedCentral Kamarudin AN, Cox T, Kolamunnage-Dona R. Time-dependent ROC curve analysis in medical research: current methods and applications. BMC Med Res Methodol. 2017;17:53.CrossRefPubMedPubMedCentral
21.
go back to reference Zou Y, Shi Y, Sun F, Liu J, Guo Y, Zhang H, Lu X, Gong Y, Xia S. Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations. Comput Methods Programs Biomed. 2022;225:107038.CrossRefPubMed Zou Y, Shi Y, Sun F, Liu J, Guo Y, Zhang H, Lu X, Gong Y, Xia S. Extreme gradient boosting model to assess risk of central cervical lymph node metastasis in patients with papillary thyroid carcinoma: Individual prediction using SHapley Additive exPlanations. Comput Methods Programs Biomed. 2022;225:107038.CrossRefPubMed
22.
go back to reference Zhang G, Shi Y, Yin P, Liu F, Fang Y, Li X, Zhang Q, Zhang Z. A machine learning model based on ultrasound image features to assess the risk of sentinel lymph node metastasis in breast cancer patients: applications of scikit-learn and SHAP. Front Oncol. 2022;12:944569.CrossRefPubMedPubMedCentral Zhang G, Shi Y, Yin P, Liu F, Fang Y, Li X, Zhang Q, Zhang Z. A machine learning model based on ultrasound image features to assess the risk of sentinel lymph node metastasis in breast cancer patients: applications of scikit-learn and SHAP. Front Oncol. 2022;12:944569.CrossRefPubMedPubMedCentral
23.
go back to reference Jiang C, Xiu Y, Qiao K, Yu X, Zhang S, Huang Y. Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework. Front Oncol. 2022;12:981059.CrossRefPubMedPubMedCentral Jiang C, Xiu Y, Qiao K, Yu X, Zhang S, Huang Y. Prediction of lymph node metastasis in patients with breast invasive micropapillary carcinoma based on machine learning and SHapley Additive exPlanations framework. Front Oncol. 2022;12:981059.CrossRefPubMedPubMedCentral
25.
go back to reference Cardoso F, Bartlett JMS, Slaets L, van Deurzen CHM, van Leeuwen-Stok E, Porter P, Linderholm B, Hedenfalk I, Schroder C, Martens J, et al. Characterization of male breast cancer: results of the EORTC 10085/TBCRC/BIG/NABCG International Male Breast Cancer Program. Ann Oncol. 2018;29:405–17.CrossRefPubMed Cardoso F, Bartlett JMS, Slaets L, van Deurzen CHM, van Leeuwen-Stok E, Porter P, Linderholm B, Hedenfalk I, Schroder C, Martens J, et al. Characterization of male breast cancer: results of the EORTC 10085/TBCRC/BIG/NABCG International Male Breast Cancer Program. Ann Oncol. 2018;29:405–17.CrossRefPubMed
26.
go back to reference Wang F, Shu X, Meszoely I, Pal T, Mayer IA, Yu Z, Zheng W, Bailey CE, Shu XO. Overall mortality after diagnosis of breast cancer in men vs women. JAMA Oncol. 2019;5:1589–96.CrossRefPubMedPubMedCentral Wang F, Shu X, Meszoely I, Pal T, Mayer IA, Yu Z, Zheng W, Bailey CE, Shu XO. Overall mortality after diagnosis of breast cancer in men vs women. JAMA Oncol. 2019;5:1589–96.CrossRefPubMedPubMedCentral
27.
go back to reference Giordano SH. A review of the diagnosis and management of male breast cancer. Oncologist. 2005;10:471–9.CrossRefPubMed Giordano SH. A review of the diagnosis and management of male breast cancer. Oncologist. 2005;10:471–9.CrossRefPubMed
28.
go back to reference Vermeulen MA, Slaets L, Cardoso F, Giordano SH, Tryfonidis K, van Diest PJ, Dijkstra NH, Schroder CP, van Asperen CJ, Linderholm B, et al. Pathological characterisation of male breast cancer: Results of the EORTC 10085/TBCRC/BIG/NABCG International Male Breast Cancer Program. Eur J Cancer. 2017;82:219–27.CrossRefPubMed Vermeulen MA, Slaets L, Cardoso F, Giordano SH, Tryfonidis K, van Diest PJ, Dijkstra NH, Schroder CP, van Asperen CJ, Linderholm B, et al. Pathological characterisation of male breast cancer: Results of the EORTC 10085/TBCRC/BIG/NABCG International Male Breast Cancer Program. Eur J Cancer. 2017;82:219–27.CrossRefPubMed
29.
go back to reference Piscuoglio S, Ng CK, Murray MP, Guerini-Rocco E, Martelotto LG, Geyer FC, Bidard FC, Berman S, Fusco N, Sakr RA, et al. The genomic landscape of male breast cancers. Clin Cancer Res. 2016;22:4045–56.CrossRefPubMedPubMedCentral Piscuoglio S, Ng CK, Murray MP, Guerini-Rocco E, Martelotto LG, Geyer FC, Bidard FC, Berman S, Fusco N, Sakr RA, et al. The genomic landscape of male breast cancers. Clin Cancer Res. 2016;22:4045–56.CrossRefPubMedPubMedCentral
30.
go back to reference Humphries MP, Sundara Rajan S, Honarpisheh H, Cserni G, Dent J, Fulford L, Jordan LB, Jones JL, Kanthan R, Litwiniuk M, et al. Characterisation of male breast cancer: a descriptive biomarker study from a large patient series. Sci Rep. 2017;7:45293.CrossRefPubMedPubMedCentral Humphries MP, Sundara Rajan S, Honarpisheh H, Cserni G, Dent J, Fulford L, Jordan LB, Jones JL, Kanthan R, Litwiniuk M, et al. Characterisation of male breast cancer: a descriptive biomarker study from a large patient series. Sci Rep. 2017;7:45293.CrossRefPubMedPubMedCentral
31.
go back to reference Hassett MJ, Somerfield MR, Baker ER, Cardoso F, Kansal KJ, Kwait DC, Plichta JK, Ricker C, Roshal A, Ruddy KJ, et al. Management of male breast cancer: ASCO Guideline. J Clin Oncol. 2020;38:1849–63.CrossRefPubMed Hassett MJ, Somerfield MR, Baker ER, Cardoso F, Kansal KJ, Kwait DC, Plichta JK, Ricker C, Roshal A, Ruddy KJ, et al. Management of male breast cancer: ASCO Guideline. J Clin Oncol. 2020;38:1849–63.CrossRefPubMed
32.
go back to reference Gradishar WJ, Moran MS, Abraham J, Aft R, Agnese D, Allison KH, Blair SL, Burstein HJ, Dang C, Elias AD, et al. NCCN Guidelines(R) Insights: Breast Cancer, Version 4.2021. J Natl Compr Canc Netw. 2021;19:484–93.CrossRefPubMed Gradishar WJ, Moran MS, Abraham J, Aft R, Agnese D, Allison KH, Blair SL, Burstein HJ, Dang C, Elias AD, et al. NCCN Guidelines(R) Insights: Breast Cancer, Version 4.2021. J Natl Compr Canc Netw. 2021;19:484–93.CrossRefPubMed
33.
go back to reference Auvinen A, Curtis RE, Ron E. Risk of subsequent cancer following breast cancer in men. J Natl Cancer Inst. 2002;94:1330–2.CrossRefPubMed Auvinen A, Curtis RE, Ron E. Risk of subsequent cancer following breast cancer in men. J Natl Cancer Inst. 2002;94:1330–2.CrossRefPubMed
35.
go back to reference Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, Dubsky P, Baltzer P, Clauser P, Kapetas P, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol. 2019;54:110–7.CrossRefPubMedPubMedCentral Tahmassebi A, Wengert GJ, Helbich TH, Bago-Horvath Z, Alaei S, Bartsch R, Dubsky P, Baltzer P, Clauser P, Kapetas P, et al. Impact of machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy and survival outcomes in breast cancer patients. Invest Radiol. 2019;54:110–7.CrossRefPubMedPubMedCentral
36.
go back to reference Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, Peng X. Predicting breast cancer 5-year survival using machine learning: a systematic review. PLoS One. 2021;16:e0250370.CrossRefPubMedPubMedCentral Li J, Zhou Z, Dong J, Fu Y, Li Y, Luan Z, Peng X. Predicting breast cancer 5-year survival using machine learning: a systematic review. PLoS One. 2021;16:e0250370.CrossRefPubMedPubMedCentral
37.
go back to reference Zhou CM, Xue Q, Wang Y, Tong J, Ji M, Yang JJ. Machine learning to predict the cancer-specific mortality of patients with primary non-metastatic invasive breast cancer. Surg Today. 2021;51:756–63.CrossRefPubMed Zhou CM, Xue Q, Wang Y, Tong J, Ji M, Yang JJ. Machine learning to predict the cancer-specific mortality of patients with primary non-metastatic invasive breast cancer. Surg Today. 2021;51:756–63.CrossRefPubMed
38.
go back to reference Lo Gullo R, Eskreis-Winkler S, Morris EA, Pinker K. Machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy. Breast. 2020;49:115–22.CrossRefPubMed Lo Gullo R, Eskreis-Winkler S, Morris EA, Pinker K. Machine learning with multiparametric magnetic resonance imaging of the breast for early prediction of response to neoadjuvant chemotherapy. Breast. 2020;49:115–22.CrossRefPubMed
39.
go back to reference Wang D, Yang L, Yang Y, Chen M, Yang H. Nomogram for predicting distant metastasis of male breast cancer: a SEER population-based study. Medicine (Baltimore). 2022;101:e30978.CrossRefPubMed Wang D, Yang L, Yang Y, Chen M, Yang H. Nomogram for predicting distant metastasis of male breast cancer: a SEER population-based study. Medicine (Baltimore). 2022;101:e30978.CrossRefPubMed
Metadata
Title
The prediction of distant metastasis risk for male breast cancer patients based on an interpretable machine learning model
Authors
Xuhai Zhao
Cong Jiang
Publication date
01-12-2023
Publisher
BioMed Central
Published in
BMC Medical Informatics and Decision Making / Issue 1/2023
Electronic ISSN: 1472-6947
DOI
https://doi.org/10.1186/s12911-023-02166-8

Other articles of this Issue 1/2023

BMC Medical Informatics and Decision Making 1/2023 Go to the issue