Skip to main content
Top
Published in: Journal of Digital Imaging 6/2018

Open Access 01-12-2018

Reengineering Workflow for Curation of DICOM Datasets

Authors: William Bennett, Kirk Smith, Quasar Jarosz, Tracy Nolan, Walter Bosch

Published in: Journal of Imaging Informatics in Medicine | Issue 6/2018

Login to get access

Abstract

Reusable, publicly available data is a pillar of open science and rapid advancement of cancer imaging research. Sharing data from completed research studies not only saves research dollars required to collect data, but also helps insure that studies are both replicable and reproducible. The Cancer Imaging Archive (TCIA) is a global shared repository for imaging data related to cancer. Insuring the consistency, scientific utility, and anonymity of data stored in TCIA is of utmost importance. As the rate of submission to TCIA has been increasing, both in volume and complexity of DICOM objects stored, the process of curation of collections has become a bottleneck in acquisition of data. In order to increase the rate of curation of image sets, improve the quality of the curation, and better track the provenance of changes made to submitted DICOM image sets, a custom set of tools was developed, using novel methods for the analysis of DICOM data sets. These tools are written in the programming language perl, use the open-source database PostgreSQL, make use of the perl DICOM routines in the open-source package Posda, and incorporate DICOM diagnostic tools from other open-source packages, such as dicom3tools. These tools are referred to as the “Posda Tools.” The Posda Tools are open source and available via git at https://​github.​com/​UAMS-DBMI/​PosdaTools. In this paper, we briefly describe the Posda Tools and discuss the novel methods employed by these tools to facilitate rapid analysis of DICOM data, including the following: (1) use a database schema which is more permissive, and differently normalized from traditional DICOM databases; (2) perform integrity checks automatically on a bulk basis; (3) apply revisions to DICOM datasets on an bulk basis, either through a web-based interface or via command line executable perl scripts; (4) all such edits are tracked in a revision tracker and may be rolled back; (5) a UI is provided to inspect the results of such edits, to verify that they are what was intended; (6) identification of DICOM Studies, Series, and SOP instances using “nicknames” which are persistent and have well-defined scope to make expression of reported DICOM errors easier to manage; and (7) rapidly identify potential duplicate DICOM datasets by pixel data is provided; this can be used, e.g., to identify submission subjects which may relate to the same individual, without identifying the individual.
Literature
2.
go back to reference Bennett W et al.: SU-GG-T-262: Open-source tool for assessing variability in DICOM data. Med Phys 37(6):3245–3245, 2010CrossRef Bennett W et al.: SU-GG-T-262: Open-source tool for assessing variability in DICOM data. Med Phys 37(6):3245–3245, 2010CrossRef
3.
go back to reference Clark K et al.: The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26(6):1045–1057, 2013CrossRef Clark K et al.: The Cancer Imaging Archive (TCIA): maintaining and operating a public information repository. J Digit Imaging 26(6):1045–1057, 2013CrossRef
5.
go back to reference Eichelberg M, et al.: Ten years of medical imaging standardization and prototypical implementation: the DICOM standard and the OFFIS DICOM toolkit (DCMTK). Medical Imaging 2004, International Society for Optics and Photonics. 2004 Eichelberg M, et al.: Ten years of medical imaging standardization and prototypical implementation: the DICOM standard and the OFFIS DICOM toolkit (DCMTK). Medical Imaging 2004, International Society for Optics and Photonics. 2004
6.
go back to reference Freymann JB et al.: Image data sharing for biomedical research—meeting HIPAA requirements for de-identification. J Digit Imaging 25(1):14–24, 2012CrossRef Freymann JB et al.: Image data sharing for biomedical research—meeting HIPAA requirements for de-identification. J Digit Imaging 25(1):14–24, 2012CrossRef
7.
go back to reference Korfiatis PD et al.: MIRMAID: A content management system for medical image analysis research. Radiographics 35(5):1461–1468, 2015CrossRef Korfiatis PD et al.: MIRMAID: A content management system for medical image analysis research. Radiographics 35(5):1461–1468, 2015CrossRef
8.
go back to reference Moore SM, et al: DICOM shareware: a public implementation of the DICOM standard. SPIE 2165, Medical Imaging 1994: PACS Design and Evaluation, Newport Beach, CA, International Society for Optics and Photonics. 1994 Moore SM, et al: DICOM shareware: a public implementation of the DICOM standard. SPIE 2165, Medical Imaging 1994: PACS Design and Evaluation, Newport Beach, CA, International Society for Optics and Photonics. 1994
9.
go back to reference Potter G et al.: Mastering DICOM with DVTk. J Digit Imaging 20(1):47–62, 2007CrossRef Potter G et al.: Mastering DICOM with DVTk. J Digit Imaging 20(1):47–62, 2007CrossRef
10.
go back to reference Prior FW, et al: TCIA: an information resource to enable open science. Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE, Osaka, Japan, IEEE. 2013 Prior FW, et al: TCIA: an information resource to enable open science. Engineering in Medicine and Biology Society (EMBC), 2013 35th Annual International Conference of the IEEE, Osaka, Japan, IEEE. 2013
11.
go back to reference Rosenstein BS et al.: How will big data improve clinical and basic research in radiation therapy? Int J Radiat Oncol Biol Phys 95(3):895–904, 2016CrossRef Rosenstein BS et al.: How will big data improve clinical and basic research in radiation therapy? Int J Radiat Oncol Biol Phys 95(3):895–904, 2016CrossRef
12.
go back to reference Tridgell A, Mackerras P: The rsync algorithm. 1996 Tridgell A, Mackerras P: The rsync algorithm. 1996
Metadata
Title
Reengineering Workflow for Curation of DICOM Datasets
Authors
William Bennett
Kirk Smith
Quasar Jarosz
Tracy Nolan
Walter Bosch
Publication date
01-12-2018
Publisher
Springer International Publishing
Published in
Journal of Imaging Informatics in Medicine / Issue 6/2018
Print ISSN: 2948-2925
Electronic ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-018-0097-4

Other articles of this Issue 6/2018

Journal of Digital Imaging 6/2018 Go to the issue