AwardsIdentified for RadioGraphics
ParticipantsAndriy Fedorov, PhD, Arlington, MA (Presenter) Nothing to Disclose
NCI Imaging Data Commons (IDC) is a cloud-based repository of publicly available cancer imaging data co-located with the analysis and exploration tools and resources. IDC is part of the NCI Cancer Research Data Commons (CRDC) infrastructure that provides secure access to a large, comprehensive, and expanding collection of cancer research data. IDC uses a combination of the commercially available Google Cloud Platform and open source components. The DICOM standard is used for representation and communication of the data. Google Healthcare API enables the use of SQL for DICOM metadata exploration. The IDC portal (https://imaging.datacommons.cancer.gov) enables exploration and visualization of data and cohort building. As of Spring 2022, IDC hosts over 30 TB of public radiology and digital pathology images and image-derived data. IDC is intended for cloud-based data processing, but data can be freely downloaded for on-premise analysis. Cloud-based workflows co-locating persistent data with software tools and compute resources enable reproducible analysis workflows. Attendees will learn about the scope and status of IDC (e.g., new features of the image viewers and IDC portal), accompanying learning resources (e.g., interactive notebooks with reproducible AI workflows applied to the data within IDC), and plans for future development. Live interactive demonstrations will be presented.
TABLE OF CONTENTS/OUTLINEOverview of CRDC and IDC; Data curation and the role of The Cancer Imaging Archive; Portal; Viewer; Organization of data; Data versioning; Integration of tools; Use case development; Documentation and user support resources; IDC cloud credit program; Status update and plans for future development.