Top

Published in:

01-12-2020 | Prostate Cancer | Imaging Informatics and Artificial Intelligence

Machine learning for the identification of clinically significant prostate cancer on MRI: a meta-analysis

Authors: Renato Cuocolo, Maria Brunella Cipullo, Arnaldo Stanzione, Valeria Romeo, Roberta Green, Valeria Cantoni, Andrea Ponsiglione, Lorenzo Ugga, Massimo Imbriaco

Published in: European Radiology | Issue 12/2020

Abstract

Objectives

The aim of this study was to systematically review the literature and perform a meta-analysis of machine learning (ML) diagnostic accuracy studies focused on clinically significant prostate cancer (csPCa) identification on MRI.

Methods

Multiple medical databases were systematically searched for studies on ML applications in csPCa identification up to July 31, 2019. Two reviewers screened all papers independently for eligibility. The area under the receiver operating characteristic curves (AUC) was pooled to quantify predictive accuracy. A random-effects model estimated overall effect size while statistical heterogeneity was assessed with the I² value. A funnel plot was used to investigate publication bias. Subgroup analyses were performed based on reference standard (biopsy or radical prostatectomy) and ML type (deep and non-deep).

Results

After the final revision, 12 studies were included in the analysis. Statistical heterogeneity was high both in overall and in subgroup analyses. The overall pooled AUC for ML in csPCa identification was 0.86, with 0.81–0.91 95% confidence intervals (95%CI). The biopsy subgroup (n = 9) had a pooled AUC of 0.85 (95%CI = 0.79–0.91) while the radical prostatectomy one (n = 3) of 0.88 (95%CI = 0.76–0.99). Deep learning ML (n = 4) had a 0.78 AUC (95%CI = 0.69–0.86) while the remaining 8 had AUC = 0.90 (95%CI = 0.85–0.94).

Conclusions

ML pipelines using prostate MRI to identify csPCa showed good accuracy and should be further investigated, possibly with better standardisation in design and reporting of results.

Key Points

• Overall pooled AUC was 0.86 with 0.81–0.91 95% confidence intervals.

• In the reference standard subgroup analysis, algorithm accuracy was similar with pooled AUCs of 0.85 (0.79–0.91 95% confidence intervals) and 0.88 (0.76–0.99 95% confidence intervals) for studies employing biopsies and radical prostatectomy, respectively.

• Deep learning pipelines performed worse (AUC = 0.78, 0.69–0.86 95% confidence intervals) than other approaches (AUC = 0.90, 0.85–0.94 95% confidence intervals).

Available only for authorised users

(2019) EAU Guidelines. Edn. presented at the EAU Annual Congress Barcelona 2019. https://uroweb.org/guideline/prostate-cancer. Accessed 13 May 2019

Turkbey B, Rosenkrantz AB, Haider MA, et al (2019) Prostate Imaging Reporting and Data System Version 2.1: 2019 Update of prostate imaging reporting and data system version 2. Eur Urol 76:340–351. https://doi.org/10.1016/j.eururo.2019.02.033

Cuocolo R, Stanzione A, Ponsiglione A et al (2019) Prostate MRI technical parameters standardization: a systematic review on adherence to PI-RADSv2 acquisition protocol. Eur J Radiol 120:108662. https://doi.org/10.1016/j.ejrad.2019.108662CrossRef

Barkovich EJ, Shankar PR, Westphalen AC (2019) A systematic review of the existing prostate imaging reporting and data system version 2 (PI-RADSv2) literature and subset meta-analysis of PI-RADSv2 categories stratified by Gleason scores. AJR Am J Roentgenol 212:847–854. https://doi.org/10.2214/AJR.18.20571CrossRef

Greer MD, Shih JH, Lay N et al (2019) Interreader variability of prostate imaging reporting and data system version 2 in detecting and assessing prostate cancer lesions at prostate MRI. AJR Am J Roentgenol 212:1197–1205. https://doi.org/10.2214/AJR.18.20536CrossRef

Kasivisvanathan V, Rannikko AS, Borghi M et al (2018) MRI-targeted or standard biopsy for prostate-cancer diagnosis. N Engl J Med 378:1767–1777. https://doi.org/10.1056/NEJMoa1801993CrossRef

Cuocolo R, Cipullo MB, Stanzione A et al (2019) Machine learning applications in prostate cancer magnetic resonance imaging. Eur Radiol Exp 3:35. https://doi.org/10.1186/s41747-019-0109-2CrossRef

Afshar P, Mohammadi A, Plataniotis KN, Oikonomou A, Benali H (2019) From handcrafted to deep-learning-based cancer radiomics: challenges and opportunities. IEEE Signal Process Mag 36:132–160. https://doi.org/10.1109/MSP.2019.2900993

Zhang L, Tang M, Chen S, Lei X, Zhang X, Huan Y (2017) A meta-analysis of use of prostate imaging reporting and data system version 2 (PI-RADS V2) with multiparametric MR imaging for the detection of prostate cancer. Eur Radiol 27:5204–5214. https://doi.org/10.1007/s00330-017-4843-7

10.

Zhen L, Liu X, Yegang C et al (2019) Accuracy of multiparametric magnetic resonance imaging for diagnosing prostate cancer: a systematic review and meta-analysis. BMC Cancer 19:1244. https://doi.org/10.1186/s12885-019-6434-2CrossRef

11.

Whiting PF (2011) QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med 155:529. https://doi.org/10.7326/0003-4819-155-8-201110180-00009CrossRef

12.

Steyerberg EW, Vickers AJ, Cook NR et al (2010) Assessing the performance of prediction models. Epidemiology 21:128–138. https://doi.org/10.1097/EDE.0b013e3181c30fb2CrossRef

13.

Higgins JP, Thompson SG, Deeks JJ, Altman DG (2003) Measuring inconsistency in meta-analyses. BMJ 327:557–560. https://doi.org/10.1136/bmj.327.7414.557

14.

Cleophas TJ, Zwinderman AH (2007) Meta-Analysis. Circulation 115:2870–2875. https://doi.org/10.1161/CIRCULATIONAHA.105.594960CrossRef

15.

The Cochrane Collaboration. Cochrane Handbook for Systematic Reviews of Interventions version 6.0. www.training.cochrane.org/handbook. Accessed 19 Mar 2020

16.

DerSimonian R, Laird N (1986) Meta-analysis in clinical trials. Control Clin Trials 7:177–188. https://doi.org/10.1016/0197-2456(86)90046-2CrossRef

17.

Egger M, Smith GD, Schneider M, Minder C (1997) Bias in meta-analysis detected by a simple, graphical test. BMJ 315:629–634. https://doi.org/10.1136/bmj.315.7109.629CrossRef

18.

R Core Team (2020) R: a language and environment for statistical computing

19.

Abraham B, Nair MS (2019) Computer-aided grading of prostate cancer from MRI images using convolutional neural networks. J Intell Fuzzy Syst 36:2015–2024. https://doi.org/10.3233/JIFS-169913CrossRef

20.

Antonelli M, Johnston EW, Dikaios N et al (2019) Machine learning classifiers can predict Gleason pattern 4 prostate cancer with greater accuracy than experienced radiologists. Eur Radiol 29:4754–4764. https://doi.org/10.1007/s00330-019-06244-2CrossRef

21.

Le MH, Chen JY, Wang L et al (2017) Automated diagnosis of prostate cancer in multi-parametric MRI based on multimodal convolutional neural networks. Phys Med Biol 62:6497–6514. https://doi.org/10.1088/1361-6560/aa7731CrossRef

22.

Sobecki P, Życka-Malesa D, Mykhalevych I, Sklinda K, Przelaskowski A (2018) MRI imaging texture features in prostate lesions classification. In: Eskola H, Väisänen O, Viik J, Hyttinen J (eds) EMBEC & NBC 2017. EMBEC 2017, NBC 2017, IFMBE proceedings, vol 65. Springer, Singapore, pp 827–830

23.

Zhong X, Cao R, Shakeri S et al (2019) Deep transfer learning-based prostate cancer classification using 3 Tesla multi-parametric MRI. Abdom Radiol (NY) 44:2030–2039. https://doi.org/10.1007/s00261-018-1824-5CrossRef

24.

Bonekamp D, Kohl S, Wiesenfarth M et al (2018) Radiomic machine learning for characterization of prostate lesions with MRI: comparison to ADC values. Radiology 289:128–137. https://doi.org/10.1148/radiol.2018173064CrossRef

25.

Chaddad A, Niazi T, Probst S, Bladou F, Anidjar M, Bahoric B (2018) Predicting Gleason score of prostate cancer patients using radiomic analysis. Front Oncol 8:630. https://doi.org/10.3389/fonc.2018.00630

26.

Chen T, Li MJ, Gu YF et al (2019) Prostate cancer differentiation and aggressiveness: assessment with a radiomic-based model vs. PI-RADS v2. J Magn Reson Imaging 49:875–884. https://doi.org/10.1002/jmri.26243CrossRef

27.

Dikaios N, Alkalbani J, Abd-Alazeez M et al (2015) Zone-specific logistic regression models improve classification of prostate cancer on multi-parametric MRI. Eur Radiol 25:2727–2737. https://doi.org/10.1007/s00330-015-3636-0CrossRef

28.

Fehr D, Veeraraghavan H, Wibmer A et al (2015) Automatic classification of prostate cancer Gleason scores from multiparametric magnetic resonance images. Proc Natl Acad Sci U S A 112:E6265–E6273. https://doi.org/10.1073/pnas.1505935112CrossRef

29.

Hectors SJ, Cherny M, Yadav KK et al (2019) Radiomics features measured with multiparametric magnetic resonance imaging predict prostate Cancer aggressiveness. J Urol 202:498–505. https://doi.org/10.1097/JU.0000000000000272CrossRef

30.

Li J, Weng Z, Xu H et al (2018) Support vector machines (SVM) classification of prostate cancer Gleason score in central gland using multiparametric magnetic resonance images: a cross-validated study. Eur J Radiol 98:61–67. https://doi.org/10.1016/j.ejrad.2017.11.001CrossRef

31.

Toivonen J, Montoya Perez I, Movahedi P et al (2019) Radiomics and machine learning of multisequence multiparametric prostate MRI: towards improved non-invasive prostate cancer characterization. PLoS One 14:e0217702. https://doi.org/10.1371/journal.pone.0217702CrossRef

32.

Goel S, Shoag JE, Gross MD et al (2019) Concordance between biopsy and radical prostatectomy pathology in the era of targeted biopsy: a systematic review and meta-analysis. Eur Urol Oncol. https://doi.org/10.1016/j.euo.2019.08.001

33.

Strang B, van der Putten P, van Rijn JN, Hutter F (2018) Don’t rule out simple models prematurely: a large scale benchmark comparing linear and non-linear classifiers in OpenML. In: Duivesteijn W, Siebes A, Ukkonen A (eds) Advances in intelligent data analysis XVII. IDA 2018, Lecture notes in computer science, vol 11191. Springer, Cham, pp 303–315CrossRef

34.

Klambauer G, Unterthiner T, Mayr A, Hochreiter S (2017) Self-normalizing neural networks. In: Proceedings of the 31st international conference on neural information processing systems (NIPS’17). Curran Associates Inc., Red Hook, pp 972–981

35.

European Society of Radiology (ESR) (2019) What the radiologist should know about artificial intelligence – an ESR white paper. Insights Imaging 10:44. https://doi.org/10.1186/s13244-019-0738-2

36.

Cronin P, Kelly AM, Altaee D, Foerster B, Petrou M, Dwamena BA (2018) How to perform a systematic review and meta-analysis of diagnostic imaging studies. Acad Radiol 25:573–593. https://doi.org/10.1016/j.acra.2017.12.007

37.

Lee J, Kim KW, Choi SH, Huh J, Park SH (2015) Systematic review and meta-analysis of studies evaluating diagnostic test accuracy: a practical review for clinical researchers-part II. Statistical methods of meta-analysis. Korean J Radiol 16:1188. https://doi.org/10.3348/kjr.2015.16.6.1188

38.

Peerlings J, Woodruff HC, Winfield JM et al (2019) Stability of radiomics features in apparent diffusion coefficient maps from a multi-centre test-retest trial. Sci Rep 9:4800. https://doi.org/10.1038/s41598-019-41344-5CrossRef

39.

Clark K, Vendt B, Smith K et al (2013) The cancer imaging archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26:1045–1057. https://doi.org/10.1007/s10278-013-9622-7CrossRef

40.

Lambin P, Leijenaar RTH, Deist TM et al (2017) Radiomics: the bridge between medical imaging and personalized medicine. Nat Rev Clin Oncol 14:749–762. https://doi.org/10.1038/nrclinonc.2017.141CrossRef

41.

Dikaios N, Alkalbani J, Sidhu HS et al (2015) Logistic regression model for diagnosis of transition zone prostate cancer on multi-parametric MRI. Eur Radiol 25:523–532. https://doi.org/10.1007/s00330-014-3386-4CrossRef

42.

Stanzione A, Cocozza S, Cuocolo R, Imbriaco M (2018) Biparametric prostate MR imaging protocol: time to revise PI-RADS version 2? Radiology 287:1082–1082. https://doi.org/10.1148/radiol.2018180292CrossRef

43.

Romeo V, Maurea S, Cuocolo R et al (2018) Characterization of adrenal lesions on unenhanced MRI using texture analysis: a machine-learning approach. J Magn Reson Imaging 48:198–204. https://doi.org/10.1002/jmri.25954CrossRef

44.

Alabousi M, Salameh J-P, Gusenbauer K et al (2019) Biparametric vs multiparametric prostate magnetic resonance imaging for the detection of prostate cancer in treatment-naïve patients: a diagnostic test accuracy systematic review and meta-analysis. BJU Int 124:209–220. https://doi.org/10.1111/bju.14759CrossRef

45.

Zwanenburg A, Leger S, Vallières M, Löck S (2016) Image biomarker standardisation initiative. arXiv:1612.07003

46.

Schwier M, van Griethuysen J, Vangel MG et al (2019) Repeatability of multiparametric prostate MRI radiomics features. Sci Rep 9:9441. https://doi.org/10.1038/s41598-019-45766-zCrossRef

47.

Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP (2002) SMOTE: synthetic minority over-sampling technique. J Artif Intell Res 16:321–357. https://doi.org/10.1613/jair.953CrossRef

48.

Romeo V, Ricciardi C, Cuocolo R et al (2019) Machine learning analysis of MRI-derived texture features to predict placenta accreta spectrum in patients with placenta previa. Magn Reson Imaging 64:71–76. https://doi.org/10.1016/j.mri.2019.05.017CrossRef

49.

Zhang X, Yan L-F, Hu Y-C et al (2017) Optimizing a machine learning based glioma grading system using multi-parametric MRI histogram and texture features. Oncotarget:8. https://doi.org/10.18632/oncotarget.18001

50.

Recht B, Roelofs R, Schmidt L, Shankar V (2019) Do ImageNet classifiers generalize to ImageNet? arXiv:1902.10811

51.

Recht B, Roelofs R, Schmidt L, Shankar V (2018) Do CIFAR-10 classifiers generalize to CIFAR-10? arXiv:1806.00451

52.

Yadav S, Shukla S (2016) Analysis of k-fold cross-validation over hold-out validation on colossal datasets for quality classification. In: 2016 IEEE 6th international conference on advanced computing (IACC). IEEE, pp 78–83

53.

Claesen M, De Moor B (2015) Hyperparameter search in machine learning. arXiv:1502.02127

54.

Rao RB, Fung G, Rosales R (2008) On the dangers of cross-validation. An experimental evaluation. In: Proceedings of the 2008 SIAM international conference on data mining. Society for Industrial and Applied Mathematics, Philadelphia, pp 588–596

55.

Jampathong N, Laopaiboon M, Rattanakanokchai S, Pattanittum P (2018) Prognostic models for complete recovery in ischemic stroke: a systematic review and meta-analysis. BMC Neurol 18:26. https://doi.org/10.1186/s12883-018-1032-5CrossRef

56.

Bluemke DA, Moy L, Bredella MA et al (2020) Assessing radiology research on artificial intelligence: a brief guide for authors, reviewers, and readers—from the radiology editorial board. Radiology 294:487–489. https://doi.org/10.1148/radiol.2019192515CrossRef

Title: Machine learning for the identification of clinically significant prostate cancer on MRI: a meta-analysis
Authors: Renato Cuocolo
Maria Brunella Cipullo
Arnaldo Stanzione
Valeria Romeo
Roberta Green
Valeria Cantoni
Andrea Ponsiglione
Lorenzo Ugga
Massimo Imbriaco
Publication date: 01-12-2020
Publisher: Springer Berlin Heidelberg
Keywords: Prostate Cancer
Magnetic Resonance Imaging
Magnetic Resonance Imaging
Prostate Cancer
Prostatectomy
Prostatectomy
Published in: European Radiology / Issue 12/2020
Print ISSN: 0938-7994
Electronic ISSN: 1432-1084
DOI: https://doi.org/10.1007/s00330-020-07027-w

At a glance: The STEP trials

Springer Medicine

Machine learning for the identification of clinically significant prostate cancer on MRI: a meta-analysis

Abstract

Objectives

Methods

Results

Conclusions

Key Points

At a glance: The STEP trials

Springer Medicine

Abstract

Objectives

Methods

Results

Conclusions

Key Points

Please log in to get access to this content

Other articles of this Issue 12/2020

Intimate partner violence crisis in the COVID-19 pandemic: how can radiologists make a difference?

Infarct growth patterns may vary in acute stroke due to large vessel occlusion and recanalization with endovascular therapy

MR elastography frequency–dependent and independent parameters demonstrate accelerated decrease of brain stiffness in elder subjects

Towards reference values of pericoronary adipose tissue attenuation: impact of coronary artery and tube voltage in coronary computed tomography angiography

Application of DynaCT angiographic reconstruction in balloon pulmonary angioplasty

Bifascicular nature of the flexor digitorum profundus and flexor pollicis longus tendons: forgotten anatomical characteristics rediscovered with US imaging