Skip to main content
Top
Published in: Journal of Digital Imaging 4/2017

Open Access 01-08-2017

Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session

Authors: Marc D. Kohli, Ronald M. Summers, J. Raymond Geis

Published in: Journal of Imaging Informatics in Medicine | Issue 4/2017

Login to get access

Abstract

At the first annual Conference on Machine Intelligence in Medical Imaging (C-MIMI), held in September 2016, a conference session on medical image data and datasets for machine learning identified multiple issues. The common theme from attendees was that everyone participating in medical image evaluation with machine learning is data starved. There is an urgent need to find better ways to collect, annotate, and reuse medical imaging data. Unique domain issues with medical image datasets require further study, development, and dissemination of best practices and standards, and a coordinated effort among medical imaging domain experts, medical imaging informaticists, government and industry data scientists, and interested commercial, academic, and government entities. High-level attributes of reusable medical image datasets suitable to train, test, validate, verify, and regulate ML products should be better described. NIH and other government agencies should promote and, where applicable, enforce, access to medical image datasets. We should improve communication among medical imaging domain experts, medical imaging informaticists, academic clinical and basic science researchers, government and industry data scientists, and interested commercial entities.
Literature
1.
go back to reference Greenspan H, van Ginneken B, Summers RM: Guest editorial deep learning in medical imaging: overview and future promise of an exciting new technique. IEEE Transactions on Medical Imaging 35:1153–1159, 2016. doi: 10.1109/TMI.2016.2553401 CrossRef Greenspan H, van Ginneken B, Summers RM: Guest editorial deep learning in medical imaging: overview and future promise of an exciting new technique. IEEE Transactions on Medical Imaging 35:1153–1159, 2016. doi: 10.​1109/​TMI.​2016.​2553401 CrossRef
2.
3.
8.
go back to reference Gueld MO, Kohnen M, Keysers D, et al: Quality of DICOM header information for image categorization. pp 280–287,2002 Gueld MO, Kohnen M, Keysers D, et al: Quality of DICOM header information for image categorization. pp 280–287,2002
18.
go back to reference Roosen J, Frans E, Wilmer A et al.: Comparison of premortem clinical diagnoses in critically ill patients and subsequent autopsy findings. Mayo Clin Proc 75:562–567, 2000CrossRefPubMed Roosen J, Frans E, Wilmer A et al.: Comparison of premortem clinical diagnoses in critically ill patients and subsequent autopsy findings. Mayo Clin Proc 75:562–567, 2000CrossRefPubMed
22.
go back to reference Borst J, Marquering HA, Kappelhof M et al.: Diagnostic accuracy of 4 commercially available semiautomatic packages for carotid artery stenosis measurement on CTA. American Journal of Neuroradiology 36:1978–1987, 2015. doi: 10.3174/ajnr.A4400 CrossRefPubMed Borst J, Marquering HA, Kappelhof M et al.: Diagnostic accuracy of 4 commercially available semiautomatic packages for carotid artery stenosis measurement on CTA. American Journal of Neuroradiology 36:1978–1987, 2015. doi: 10.​3174/​ajnr.​A4400 CrossRefPubMed
23.
go back to reference Zussman BM, Boghosian G, Gorniak RJ et al.: The relative effect of vendor variability in CT perfusion results: a method comparison study. American Journal of Roentgenology 197:468–473, 2011. doi: 10.2214/AJR.10.6058 CrossRefPubMed Zussman BM, Boghosian G, Gorniak RJ et al.: The relative effect of vendor variability in CT perfusion results: a method comparison study. American Journal of Roentgenology 197:468–473, 2011. doi: 10.​2214/​AJR.​10.​6058 CrossRefPubMed
25.
26.
go back to reference Caballero Y, Bello R, Taboada A, et al: A new measure based in the rough set theory to estimate the training set quality. In: 2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing,2006, pp 133–140 Caballero Y, Bello R, Taboada A, et al: A new measure based in the rough set theory to estimate the training set quality. In: 2006 Eighth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing,2006, pp 133–140
28.
go back to reference Deng J, Dong W, Socher R, et al: ImageNet: a large-scale hierarchical image database. CVPR09,2009 Deng J, Dong W, Socher R, et al: ImageNet: a large-scale hierarchical image database. CVPR09,2009
29.
go back to reference Yosinski J, Clune J, Bengio Y, Lipson H: How transferable are features in deep neural networks? arXiv:1411.1792 [cs],2014 Yosinski J, Clune J, Bengio Y, Lipson H: How transferable are features in deep neural networks? arXiv:1411.1792 [cs],2014
31.
35.
Metadata
Title
Medical Image Data and Datasets in the Era of Machine Learning—Whitepaper from the 2016 C-MIMI Meeting Dataset Session
Authors
Marc D. Kohli
Ronald M. Summers
J. Raymond Geis
Publication date
01-08-2017
Publisher
Springer International Publishing
Published in
Journal of Imaging Informatics in Medicine / Issue 4/2017
Print ISSN: 2948-2925
Electronic ISSN: 2948-2933
DOI
https://doi.org/10.1007/s10278-017-9976-3

Other articles of this Issue 4/2017

Journal of Digital Imaging 4/2017 Go to the issue