🔬 Dataset¶
The challenge cohort consists of patients with histologically proven malignant melanoma, lymphoma or lung cancer as well as negative control patients who were examined by FDG-PET/CT in two large medical centers (University Hospital Tübingen, Germany & University Hospital of the LMU in Munich, Germany).
All PET/CT data within this challenge have been acquired on state-of-the-art PET/CT scanners (Siemens Biograph mCT, mCT Flow and Biograph 64, GE Discovery 690) using standardized protocols following international guidelines. CT as well as PET data are provided as 3D volumes consisting of stacks of axial slices. Data provided as part of this challenge consists of whole-body examinations. Usually, the scan range of these examinations extends from the skull base to the mid-thigh level. If clinically relevant, scans can be extended to cover the entire body including the entire head and legs/feet.
🎥 PET/CT acquisition protocol¶
University Hospital Tübingen: Patients fasted at least 6 h prior to the injection of approximately 350 MBq 18F-FDG. Whole-body PET/CT images were acquired using a Biograph mCT PET/CT scanner (Siemens, Healthcare GmbH, Erlangen, Germany) and were initiated approximately 60 min after intravenous tracer administration. Diagnostic CT scans of the neck, thorax, abdomen and pelvis (200 reference mAs; 120 kV) were acquired 90 sec after intravenous injection of a contrast agent (90–120 ml Ultravist 370, Bayer AG). PET Images were reconstructed iteratively (three iterations, 21 subsets) with Gaussian post-reconstruction smoothing (2 mm full width at half-maximum). Slice thickness on contrast-enhanced CT was 2 or 3 mm.
University Hospital of the LMU in Munich: Patients fasted at least 6 h prior to the injection of approximately 250 MBq 18F-FDG. Whole-body PET/CT images were acquired on state-of-the-art PET/CT scanners (Siemens Biograph mCT, mCT Flow and Biograph 64, GE Discovery 690) and were initiated approximately 60 min after intravenous tracer administration. Diagnostic CT scans of the neck, thorax, abdomen and pelvis (100–190 mAs; 120 kV) were acquired 90 sec after weight-adapted intravenous injection of a contrast agent (Ultravist 300, Bayer AG or Imeron 350, Bracco Imaging Deutschland GmbH). PET Images were reconstructed iteratively (three iterations, 21 subsets) with Gaussian post-reconstruction smoothing (2 mm full width at half-maximum). Slice thickness on contrast-enhanced CT was 3 mm.
⌛ Training and test cohort¶
Training cases: 1,014 studies (900 patients)
Test cases (final evaluation): 200 150 studies
Test cases (preliminary evaluation): 5 studies
A case (training or test case) consists of one 3D whole body FDG-PET volume, one corresponding 3D whole body CT volume and one 3D binary mask of manually segmented tumor lesions on FDG-PET of the size of the PET volume. CT and PET were acquired simultaneously on a single PET/CT scanner in one session; thus PET and CT are anatomically aligned up to minor shifts due to physiological motion.¶
Training set¶
Training data consists of 1,014 studies acquired at the University Hospital Tübingen and is made publicly available on TCIA (as DICOM, NiFTI and HDF5 files).
Preliminary download links (for convenience):
NiFTI data: http://data.midaslab.org/autoPET/data/nifti.zipClinical data: http://data.midaslab.org/autoPET/data/autoPETmeta.csv
If you experience issues with the download link try using wget or copy
the link into the browser address line.
The dataset is provided under CC BY license, i.e. redistribution,
remixing, adapting, and building upon this work, even commercially, is
allowed as long as the original creation is credited.
Update (June 3rd): Data are now available on TCIA and can be downloaded from there in
the DICOM format. After download, you can convert the DICOM files to
e.g. the NIfTI format using scripts
provided here.
If you use this data, please cite:
Gatidis S, Kuestner T. A whole-body FDG-PET/CT dataset with manually annotated tumor lesions
(FDG-PET-CT-Lesions) [Dataset]. The Cancer Imaging Archive, 2022. DOI: 10.7937/gkr0-xv29
Preliminary test set¶
For the self-evaluation of participating pipelines, we provide access to a preliminary test set. The preliminary test set uses the same imaging data as the final test set, but consists of 5 studies only.
The access to this preliminary set is restricted and only possible through the docker containers submitted to the challenge, and only available for a limited time during the competition. The purpose of this is that participants can check the sanity of their approaches.
Final test set¶
The final test set is consists of 200 150 studies with 100 studies
originating from the same hospital as the training database (University
Hospital Tübingen) and 100 50 are drawn from a different hospital
(University Hospital of the LMU in Munich) with similar acquisition
protocol to assess algorithm robustness and generalizability.
Data pre-processing and structure¶
In a pre-processing step, the TCIA DICOM files are resampled (CT to PET imaging resolution, i.e. same matrix size) and normalized (PET converted to standardized update values; SUV).
For the challenge, the pre-processed data will be provided in NifTI
format. PET data is standardized by converting image units from activity
counts to standardized uptake values (SUV). We recommend to use the
resampled CT (CTres.nii.gz
) and the PET in SUV (SUV.nii.gz
). The
mask (SEG.nii.gz
) is binary with 1
indicating the lesion. The
training and test database have the following structure.
NiFTI
|--- Patient 1
|--- Study 1
|--- SUV.nii.gz (PET image in SUV)
|--- CTres.nii.gz (CT image resampled to PET)
|--- CT.nii.gz (Original CT image)
|--- SEG.nii.gz (Manual annotations of tumor lesions)
|--- PET.nii.gz (Original PET image as actictivity counts)
|--- Study 2 (Potential 2nd visit of same patient)
|--- ...
|--- Patient 2
|--- ...
Each NiFTI file contains the respective image or the mask.
An example case can be loaded as:
import nibabel as nib
SUV = nib.load(os.path.join(data_root_path, 'PETCT_0af7ffe12a', '08-12-2005-NA-PET-CT Ganzkoerper primaer mit KM-96698', 'SUV.nii.gz'))
where PETCT_0af7ffe12a
is the fully anonymized patient and
08-12-2005-NA-PET-CT Ganzkoerper primaer mit KM-96698
is the
anonymized study (randomly generated study name, date is not reflecting
scan date).
MHA
Please note, that the submission and evaluation interfaces provided by
grand-challenge are working with .mha
data. Hence, you will need to
read the test images in your submission from an .mha
file. Please see
the submission for more information:
import SimpleITK
image =SimpleITK.GetArrayFromImage(SimpleITK.ReadImage(inputImageFileName))
If you want to work on NiFTI or other file formats internally, we also provide some conversion scripts: https://github.com/lab-midas/autoPET
✒ Annotation¶
Two experts annotated training and test data: At the University Hospital Tübingen, a Radiologist with 10 years of experience in Hybrid Imaging and experience in machine learning research annotated all data. At the University Hospital of the LMU in Munich, a Radiologist with 5 years of of experience in Hybrid Imaging and experience in machine learning research annotated all data.
The following annotation protocol was defined:Step 1: Identification of FDG-avid tumor lesions by visual assessment of PET and CT information together with the clinical examination reports.Step 2: Manual free-hand segmentation of identified lesions in axial slices.