Skip to main content
Top

Open Access 03-02-2025 | Transcatheter Aortic Valve Implantation | Original Article

Multi-modal dataset creation for federated learning with DICOM-structured reports

Authors: Malte Tölle, Lukas Burger, Halvar Kelm, Florian André, Peter Bannas, Gerhard Diller, Norbert Frey, Philipp Garthe, Stefan Groß, Anja Hennemuth, Lars Kaderali, Nina Krüger, Andreas Leha, Simon Martin, Alexander Meyer, Eike Nagel, Stefan Orwat, Clemens Scherer, Moritz Seiffert, Jan Moritz Seliger, Stefan Simm, Tim Friede, Tim Seidler, Sandy Engelhardt

Published in: International Journal of Computer Assisted Radiology and Surgery

Login to get access

Abstract

Purpose Federated training is often challenging on heterogeneous datasets due to divergent data storage options, inconsistent naming schemes, varied annotation procedures, and disparities in label quality. This is particularly evident in the emerging multi-modal learning paradigms, where dataset harmonization including a uniform data representation and filtering options are of paramount importance.
Methods DICOM-structured reports enable the standardized linkage of arbitrary information beyond the imaging domain and can be used within Python deep learning pipelines with highdicom. Building on this, we developed an open platform for data integration with interactive filtering capabilities, thereby simplifying the process of creation of patient cohorts over several sites with consistent multi-modal data.
Results In this study, we extend our prior work by showing its applicability to more and divergent data types, as well as streamlining datasets for federated training within an established consortium of eight university hospitals in Germany. We prove its concurrent filtering ability by creating harmonized multi-modal datasets across all locations for predicting the outcome after minimally invasive heart valve replacement. The data include imaging and waveform data (i.e., computed tomography images, electrocardiography scans) as well as annotations (i.e., calcification segmentations, and pointsets), and metadata (i.e., prostheses and pacemaker dependency).
Conclusion Structured reports bridge the traditional gap between imaging systems and information systems. Utilizing the inherent DICOM reference system arbitrary data types can be queried concurrently to create meaningful cohorts for multi-centric data analysis. The graphical interface as well as example structured report templates are available at https://​github.​com/​Cardio-AI/​fl-multi-modal-dataset-creation.
Appendix
Available only for authorised users
Footnotes
9
TID3802 Cardiovascular PatientHistory, TID3829 Problem Properties, TID3830 Procedure Properties, TID3831 Medical Device Use, TID 3704 Patient Characteristics for ECG, TID3702 Prior ECG Study, TID3708 ECG Waveform Information, TID3715 ECG Measurement Source, TID3713 ECG Global Measurements, TID3714 ECG Lead Measurements, TID3717 ECG Qualitative Analysis, TID3719 ECG Summary, TID3700 ECG Report, TID2002 Report Narrative Code, TID2000 Basic Diagnostic Imaging Report.
 
Literature
5.
go back to reference Fedorov A, Longabaugh WJR, Pot D, Clunie DA, Pieper SD, Gibbs DL, Bridge C, Herrmann MD, Homeyer A, Lewis R, Aerts HJWL, Krishnaswamy D, Thiriveedhi VK, Ciausu C, Schacherer DP, Bontempi D, Pihl T, Wagner U, Farahani K, Kim E et al (2023) National cancer institute imaging data commons: toward transparency, reproducibility, and scalability in imaging artificial intelligence. Radiographics. https://doi.org/10.1148/rg.230180CrossRefPubMed Fedorov A, Longabaugh WJR, Pot D, Clunie DA, Pieper SD, Gibbs DL, Bridge C, Herrmann MD, Homeyer A, Lewis R, Aerts HJWL, Krishnaswamy D, Thiriveedhi VK, Ciausu C, Schacherer DP, Bontempi D, Pihl T, Wagner U, Farahani K, Kim E et al (2023) National cancer institute imaging data commons: toward transparency, reproducibility, and scalability in imaging artificial intelligence. Radiographics. https://​doi.​org/​10.​1148/​rg.​230180CrossRefPubMed
10.
go back to reference Landman B, Xu Z, Igelsias J, Styner M, Langerak T, Klein A (2015) Multi-atlas labeling beyond the cranial vault-workshop and challenge. In: MICCAI multi-atlas labeling beyond the cranial vault-workshop and challenge. https://doi.org/10.7303/SYN3193805 Landman B, Xu Z, Igelsias J, Styner M, Langerak T, Klein A (2015) Multi-atlas labeling beyond the cranial vault-workshop and challenge. In: MICCAI multi-atlas labeling beyond the cranial vault-workshop and challenge. https://​doi.​org/​10.​7303/​SYN3193805
12.
go back to reference Martín-Isla C, Campello VM, Izquierdo C, Kushibar K, Sendra-Balcells C, Gkontra P, Sojoudi A, Fulton MJ, Arega TW, Punithakumar K, Li L, Sun X, Al Khalil Y, Liu D, Jabbar S, Queirós S, Galati F, Mazher M, Gao Z, Beetz M et al (2023) Deep learning segmentation of the right ventricle in cardiac MRI: the M &Ms challenge. IEEE J Biomed Health Inform 27(7):3302–3313. https://doi.org/10.1109/JBHI.2023.3267857CrossRefPubMed Martín-Isla C, Campello VM, Izquierdo C, Kushibar K, Sendra-Balcells C, Gkontra P, Sojoudi A, Fulton MJ, Arega TW, Punithakumar K, Li L, Sun X, Al Khalil Y, Liu D, Jabbar S, Queirós S, Galati F, Mazher M, Gao Z, Beetz M et al (2023) Deep learning segmentation of the right ventricle in cardiac MRI: the M &Ms challenge. IEEE J Biomed Health Inform 27(7):3302–3313. https://​doi.​org/​10.​1109/​JBHI.​2023.​3267857CrossRefPubMed
15.
go back to reference Oquab M, Darcet T, Moutakanni T, Vo HV, Szafraniec M, Khalidov V, Fernandez P, HAZIZA D, Massa F, El-Nouby A, Assran M, Ballas N, Galuba W, Howes R, Huang PY, Li SW, Misra I, Rabbat M, Sharma V, Synnaeve G, et al (2024) DINOv2: learning robust visual features without supervision. Trans Mach Learn Res. https://doi.org/10.48550/arXiv.2304.07193 Oquab M, Darcet T, Moutakanni T, Vo HV, Szafraniec M, Khalidov V, Fernandez P, HAZIZA D, Massa F, El-Nouby A, Assran M, Ballas N, Galuba W, Howes R, Huang PY, Li SW, Misra I, Rabbat M, Sharma V, Synnaeve G, et al (2024) DINOv2: learning robust visual features without supervision. Trans Mach Learn Res. https://​doi.​org/​10.​48550/​arXiv.​2304.​07193
17.
18.
go back to reference Riepenhausen S, Blumenstock M, Niklas C, Hegselmann S, Neuhaus P, Meidt A, Püttmann C, Storck M, Ganzinger M, Varghese J, Dugas M (2024) Europe’s largest research infrastructure for curated medical data models with semantic annotations. Methods Inf Med EFirst. https://doi.org/10.1055/s-0044-1786839CrossRef Riepenhausen S, Blumenstock M, Niklas C, Hegselmann S, Neuhaus P, Meidt A, Püttmann C, Storck M, Ganzinger M, Varghese J, Dugas M (2024) Europe’s largest research infrastructure for curated medical data models with semantic annotations. Methods Inf Med EFirst. https://​doi.​org/​10.​1055/​s-0044-1786839CrossRef
21.
go back to reference Scherer J, Nolden M, Kleesiek J, Metzger J, Kades K, Schneider V, Bach M, Sedlaczek O, Bucher AM, Vogl TJ, Grünwald F, Kühn JP, Hoffmann RT, Kotzerke J, Bethge O, Schimmöller L, Antoch G, Müller HW, Daul A, Nikolaou K et al (2020) Joint imaging platform for federated clinical data analytics. JCO Clin Cancer Inf 4:1027–1038. https://doi.org/10.1200/CCI.20.00045CrossRef Scherer J, Nolden M, Kleesiek J, Metzger J, Kades K, Schneider V, Bach M, Sedlaczek O, Bucher AM, Vogl TJ, Grünwald F, Kühn JP, Hoffmann RT, Kotzerke J, Bethge O, Schimmöller L, Antoch G, Müller HW, Daul A, Nikolaou K et al (2020) Joint imaging platform for federated clinical data analytics. JCO Clin Cancer Inf 4:1027–1038. https://​doi.​org/​10.​1200/​CCI.​20.​00045CrossRef
22.
go back to reference Seidler T, Tölle M, André F, Bannas P, Frey N, Friedrich S, Groß S, Hennemuth A, Krüger N, Leha A, Martin S, Meyer A, Nagel E, Orwat S, Scherer C, Simm TStefan Friede, Engelhardt S, (2022) Federated learning of TAVI outcomes (FLOTO)—a collaborative multi-center deep learning initiative. Clin Res Cardiol. https://doi.org/10.1007/s00392-022-02002-5 Seidler T, Tölle M, André F, Bannas P, Frey N, Friedrich S, Groß S, Hennemuth A, Krüger N, Leha A, Martin S, Meyer A, Nagel E, Orwat S, Scherer C, Simm TStefan Friede, Engelhardt S, (2022) Federated learning of TAVI outcomes (FLOTO)—a collaborative multi-center deep learning initiative. Clin Res Cardiol. https://​doi.​org/​10.​1007/​s00392-022-02002-5
25.
go back to reference Tölle M, Garthe P, Scherer C, Seliger JM, Leha A, Krüger N, Simm S, Martin S, Eble S, Kelm H, Bednorz M, André F, Bannas P, Diller G, Frey N, Groß S, Hennemuth A, Kaderali L, Meyer A, Nagel E, et al (2024) Real World Federated Learning with a Knowledge Distilled Transformer for Cardiac CT Imaging.npj Digital Medicine https://doi.org/10.1038/s41746-025-01434-3 Tölle M, Garthe P, Scherer C, Seliger JM, Leha A, Krüger N, Simm S, Martin S, Eble S, Kelm H, Bednorz M, André F, Bannas P, Diller G, Frey N, Groß S, Hennemuth A, Kaderali L, Meyer A, Nagel E, et al (2024) Real World Federated Learning with a Knowledge Distilled Transformer for Cardiac CT Imaging.npj Digital Medicine https://​doi.​org/​10.​1038/​s41746-025-01434-3
26.
go back to reference Wasserthal J, Breit HC, Meyer MT, Pradella M, Hinck D, Sauter AW, Heye T, Boll D, Cyriac J, Yang S, Bach M, Segeroth M (2023) TotalSegmentator: robust segmentation of 104 anatomical structures in CT images. Radiol Artif Intell. https://doi.org/10.1148/ryai.230024 Wasserthal J, Breit HC, Meyer MT, Pradella M, Hinck D, Sauter AW, Heye T, Boll D, Cyriac J, Yang S, Bach M, Segeroth M (2023) TotalSegmentator: robust segmentation of 104 anatomical structures in CT images. Radiol Artif Intell. https://​doi.​org/​10.​1148/​ryai.​230024
Metadata
Title
Multi-modal dataset creation for federated learning with DICOM-structured reports
Authors
Malte Tölle
Lukas Burger
Halvar Kelm
Florian André
Peter Bannas
Gerhard Diller
Norbert Frey
Philipp Garthe
Stefan Groß
Anja Hennemuth
Lars Kaderali
Nina Krüger
Andreas Leha
Simon Martin
Alexander Meyer
Eike Nagel
Stefan Orwat
Clemens Scherer
Moritz Seiffert
Jan Moritz Seliger
Stefan Simm
Tim Friede
Tim Seidler
Sandy Engelhardt
Publication date
03-02-2025
Publisher
Springer International Publishing
Published in
International Journal of Computer Assisted Radiology and Surgery
Print ISSN: 1861-6410
Electronic ISSN: 1861-6429
DOI
https://doi.org/10.1007/s11548-025-03327-y