Skip to main content
Top
Published in: European Radiology 9/2022

Open Access 09-04-2022 | Musculoskeletal

Development and evaluation of machine learning models based on X-ray radiomics for the classification and differentiation of malignant and benign bone tumors

Authors: Claudio E. von Schacky, Nikolas J. Wilhelm, Valerie S. Schäfer, Yannik Leonhardt, Matthias Jung, Pia M. Jungmann, Maximilian F. Russe, Sarah C. Foreman, Felix G. Gassert, Florian T. Gassert, Benedikt J. Schwaiger, Carolin Mogler, Carolin Knebel, Ruediger von Eisenhart-Rothe, Marcus R. Makowski, Klaus Woertler, Rainer Burgkart, Alexandra S. Gersing

Published in: European Radiology | Issue 9/2022

Login to get access

Abstract

Objectives

To develop and validate machine learning models to distinguish between benign and malignant bone lesions and compare the performance to radiologists.

Methods

In 880 patients (age 33.1 ± 19.4 years, 395 women) diagnosed with malignant (n = 213, 24.2%) or benign (n = 667, 75.8%) primary bone tumors, preoperative radiographs were obtained, and the diagnosis was established using histopathology. Data was split 70%/15%/15% for training, validation, and internal testing. Additionally, 96 patients from another institution were obtained for external testing. Machine learning models were developed and validated using radiomic features and demographic information. The performance of each model was evaluated on the test sets for accuracy, area under the curve (AUC) from receiver operating characteristics, sensitivity, and specificity. For comparison, the external test set was evaluated by two radiology residents and two radiologists who specialized in musculoskeletal tumor imaging.

Results

The best machine learning model was based on an artificial neural network (ANN) combining both radiomic and demographic information achieving 80% and 75% accuracy at 75% and 90% sensitivity with 0.79 and 0.90 AUC on the internal and external test set, respectively. In comparison, the radiology residents achieved 71% and 65% accuracy at 61% and 35% sensitivity while the radiologists specialized in musculoskeletal tumor imaging achieved an 84% and 83% accuracy at 90% and 81% sensitivity, respectively.

Conclusions

An ANN combining radiomic features and demographic information showed the best performance in distinguishing between benign and malignant bone lesions. The model showed lower accuracy compared to specialized radiologists, while accuracy was higher or similar compared to residents.

Key Points

• The developed machine learning model could differentiate benign from malignant bone tumors  using radiography with an AUC of 0.90 on the external test set.
• Machine learning models that used radiomic features or demographic information alone performed worse than those that used both radiomic features and demographic information as input, highlighting the importance of building comprehensive machine learning models.
• An artificial neural network that combined both radiomic and demographic information achieved the best performance and its performance was compared to radiology readers on an external test set.
Appendix
Available only for authorised users
Literature
1.
go back to reference Costelloe CM, Madewell JE (2013) Radiography in the initial diagnosis of primary bone tumors. AJR Am J Roentgenol 200:3–7CrossRef Costelloe CM, Madewell JE (2013) Radiography in the initial diagnosis of primary bone tumors. AJR Am J Roentgenol 200:3–7CrossRef
2.
go back to reference Miller TT (2008) Bone tumors and tumorlike conditions: analysis with conventional radiography. Radiology 246:662–674CrossRef Miller TT (2008) Bone tumors and tumorlike conditions: analysis with conventional radiography. Radiology 246:662–674CrossRef
3.
go back to reference Lalam R, Bloem JL, Noebauer-Huhmann IM et al (2017) ESSR consensus document for detection, characterization, and referral pathway for tumors and tumorlike lesions of bone. Semin Musculoskelet Radiol 21:630–647CrossRef Lalam R, Bloem JL, Noebauer-Huhmann IM et al (2017) ESSR consensus document for detection, characterization, and referral pathway for tumors and tumorlike lesions of bone. Semin Musculoskelet Radiol 21:630–647CrossRef
4.
go back to reference Lodwick GS (1964) Radiographic diagnosis and grading of bone tumors, with comments on computer evaluation. Proc Natl Cancer Conf 5:369–380PubMed Lodwick GS (1964) Radiographic diagnosis and grading of bone tumors, with comments on computer evaluation. Proc Natl Cancer Conf 5:369–380PubMed
5.
go back to reference Caracciolo JT, Temple HT, Letson GD, Kransdorf MJ (2016) A modified Lodwick-Madewell grading system for the evaluation of lytic bone lesions. AJR Am J Roentgenol 207:150–156CrossRef Caracciolo JT, Temple HT, Letson GD, Kransdorf MJ (2016) A modified Lodwick-Madewell grading system for the evaluation of lytic bone lesions. AJR Am J Roentgenol 207:150–156CrossRef
6.
go back to reference Nascimento D, Suchard G, Hatem M, de Abreu A (2014) The role of magnetic resonance imaging in the evaluation of bone tumours and tumour-like lesions. Insights Imaging 5:419–440CrossRef Nascimento D, Suchard G, Hatem M, de Abreu A (2014) The role of magnetic resonance imaging in the evaluation of bone tumours and tumour-like lesions. Insights Imaging 5:419–440CrossRef
7.
go back to reference van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104–e107CrossRef van Griethuysen JJM, Fedorov A, Parmar C et al (2017) Computational radiomics system to decode the radiographic phenotype. Cancer Res 77:e104–e107CrossRef
8.
go back to reference Li H, Mendel KR, Lan L, Sheth D, Giger ML (2019) Digital mammography in breast cancer: additive value of radiomics of breast parenchyma. Radiology 291:15–20CrossRef Li H, Mendel KR, Lan L, Sheth D, Giger ML (2019) Digital mammography in breast cancer: additive value of radiomics of breast parenchyma. Radiology 291:15–20CrossRef
9.
go back to reference Beig N, Khorrami M, Alilou M et al (2019) Perinodular and intranodular radiomic features on lung CT images distinguish adenocarcinomas from granulomas. Radiology 290:783–792CrossRef Beig N, Khorrami M, Alilou M et al (2019) Perinodular and intranodular radiomic features on lung CT images distinguish adenocarcinomas from granulomas. Radiology 290:783–792CrossRef
10.
go back to reference Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577CrossRef Gillies RJ, Kinahan PE, Hricak H (2016) Radiomics: images are more than pictures, they are data. Radiology 278:563–577CrossRef
12.
go back to reference Prinzie A, Van den Poel D (2007) Random multiclass classification: generalizing random forests to random MNL and random NB. In: Wagner R, Revell N, Pernul G (eds) Database and Expert Systems Applications. Springer, Berlin Heidelberg, pp 349–358CrossRef Prinzie A, Van den Poel D (2007) Random multiclass classification: generalizing random forests to random MNL and random NB. In: Wagner R, Revell N, Pernul G (eds) Database and Expert Systems Applications. Springer, Berlin Heidelberg, pp 349–358CrossRef
13.
go back to reference Do BH, Langlotz C, Beaulieu CF (2017) Bone tumor diagnosis using a Naïve Bayesian model of demographic and radiographic features. J Digit Imaging 30:640–647CrossRef Do BH, Langlotz C, Beaulieu CF (2017) Bone tumor diagnosis using a Naïve Bayesian model of demographic and radiographic features. J Digit Imaging 30:640–647CrossRef
14.
go back to reference Choi YS, Ahn SS, Chang JH et al (2020) Machine learning and radiomic phenotyping of lower grade gliomas: improving survival prediction. Eur Radiol 30:3834–3842CrossRef Choi YS, Ahn SS, Chang JH et al (2020) Machine learning and radiomic phenotyping of lower grade gliomas: improving survival prediction. Eur Radiol 30:3834–3842CrossRef
15.
go back to reference Weikert T, Francone M, Abbara S et al (2021) Machine learning in cardiovascular radiology: ESCR position statement on design requirements, quality assessment, current applications, opportunities, and challenges. Eur Radiol 31:3909–3922CrossRef Weikert T, Francone M, Abbara S et al (2021) Machine learning in cardiovascular radiology: ESCR position statement on design requirements, quality assessment, current applications, opportunities, and challenges. Eur Radiol 31:3909–3922CrossRef
16.
go back to reference Fedorov A, Beichel R, Kalpathy-Cramer J et al (2012) 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn Reson Imaging 30:1323–1341CrossRef Fedorov A, Beichel R, Kalpathy-Cramer J et al (2012) 3D Slicer as an image computing platform for the Quantitative Imaging Network. Magn Reson Imaging 30:1323–1341CrossRef
18.
go back to reference Howard J, Gugger S (2020) Fastai: a layered API for deep learning. Information 11:108CrossRef Howard J, Gugger S (2020) Fastai: a layered API for deep learning. Information 11:108CrossRef
20.
go back to reference Robin X, Turck N, Hainard A et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform 12:77–77CrossRef Robin X, Turck N, Hainard A et al (2011) pROC: an open-source package for R and S+ to analyze and compare ROC curves. BMC Bioinform 12:77–77CrossRef
21.
go back to reference Gitto S, Cuocolo R, Albano D et al (2020) MRI radiomics-based machine-learning classification of bone chondrosarcoma. Eur J Radiol 128:109043CrossRef Gitto S, Cuocolo R, Albano D et al (2020) MRI radiomics-based machine-learning classification of bone chondrosarcoma. Eur J Radiol 128:109043CrossRef
22.
go back to reference Mattonen SA, Davidzon GA, Benson J et al (2019) Bone marrow and tumor radiomics at 18F-FDG PET/CT: impact on outcome prediction in non-small cell lung cancer. Radiology 293:451–459CrossRef Mattonen SA, Davidzon GA, Benson J et al (2019) Bone marrow and tumor radiomics at 18F-FDG PET/CT: impact on outcome prediction in non-small cell lung cancer. Radiology 293:451–459CrossRef
23.
go back to reference Reinus WR, Wilson AJ, Kalman B, Kwasny S (1994) Diagnosis of focal bone lesions using neural networks. Invest Radiol 29:606–611 Reinus WR, Wilson AJ, Kalman B, Kwasny S (1994) Diagnosis of focal bone lesions using neural networks. Invest Radiol 29:606–611
24.
go back to reference Kahn CE Jr, Laur JJ, Carrera GF (2001) A Bayesian network for diagnosis of primary bone tumors. J Digit Imaging 14:56–57CrossRef Kahn CE Jr, Laur JJ, Carrera GF (2001) A Bayesian network for diagnosis of primary bone tumors. J Digit Imaging 14:56–57CrossRef
25.
go back to reference Stiller CA, Trama A, Serraino D et al (2013) Descriptive epidemiology of sarcomas in Europe: report from the RARECARE project. Eur J Cancer 49:684–695CrossRef Stiller CA, Trama A, Serraino D et al (2013) Descriptive epidemiology of sarcomas in Europe: report from the RARECARE project. Eur J Cancer 49:684–695CrossRef
26.
go back to reference Hashimoto K, Nishimura S, Oka N, Akagi M (2020) Clinical features and outcomes of primary bone and soft tissue sarcomas in adolescents and young adults. Mol Clin Oncol 12:358–364PubMedPubMedCentral Hashimoto K, Nishimura S, Oka N, Akagi M (2020) Clinical features and outcomes of primary bone and soft tissue sarcomas in adolescents and young adults. Mol Clin Oncol 12:358–364PubMedPubMedCentral
Metadata
Title
Development and evaluation of machine learning models based on X-ray radiomics for the classification and differentiation of malignant and benign bone tumors
Authors
Claudio E. von Schacky
Nikolas J. Wilhelm
Valerie S. Schäfer
Yannik Leonhardt
Matthias Jung
Pia M. Jungmann
Maximilian F. Russe
Sarah C. Foreman
Felix G. Gassert
Florian T. Gassert
Benedikt J. Schwaiger
Carolin Mogler
Carolin Knebel
Ruediger von Eisenhart-Rothe
Marcus R. Makowski
Klaus Woertler
Rainer Burgkart
Alexandra S. Gersing
Publication date
09-04-2022
Publisher
Springer Berlin Heidelberg
Published in
European Radiology / Issue 9/2022
Print ISSN: 0938-7994
Electronic ISSN: 1432-1084
DOI
https://doi.org/10.1007/s00330-022-08764-w

Other articles of this Issue 9/2022

European Radiology 9/2022 Go to the issue