ml-workgroup / covid-19-image-repository

Anonymized dataset of COVID-19 cases with a focus on radiological imaging. This includes images (x-ray / ct) with extensive metadata, such as admission-, ICU-, laboratory-, and patient master-data.
Other
44 stars 17 forks source link

Adding these images to the covid-chestxray-dataset ? #1

Open ieee8023 opened 4 years ago

ieee8023 commented 4 years ago

Hey! This data release is awesome! Thanks for releasing all this data with the detailed metadata!

I have been building this dataset: https://github.com/ieee8023/covid-chestxray-dataset

I would like to merge your images into this dataset. Is this ok with you? Let me know if not. I don't want to steal the thunder of your dataset!

liob commented 4 years ago

I have seen the fantastic COVID ChestXray Dataset. Great work! We deliberately chose a license that explicitly allows remixing (CC BY 3.0). We would love to see our data integrated. Please include:

We will update this repo constantly. The next data upload is expected next Monday.

Currently, this repo only includes low resolution, low bitrate PNG files. We also have anonymized high-quality Nifti files. Unfortunately, these files are too large to upload them into the repo. At the moment, I send out Dropbox links (sigh) to any interested party. However, we expect to have a direct download solution shortly. If you are interested, I can send you the Nifti files.

ieee8023 commented 4 years ago

This is great! Yes please share the NIFTI files ( joseph@josephpcohen.com ).

I will write a script so it will automatically merge new releases.

If you gzip the NIFTI files they are probably very small right? As an nii.gz? I read the GitHub repo size limit is 100GB.

Otherwise, check out Academic Torrents. I am using it to distribute COVID CT scans.

liob commented 4 years ago

The X-Ray files are up to 25 MB (nii.gz). However, we will include CT scans shortly. These files can exceed 1 GB easily. Github has a hard limit of 100 MB per file max. Academic Torrents sounds interesting. I will take a closer look.

liob commented 4 years ago

Hello @ieee8023,

we have released version 2.0 of the data set. Additionally, the un-altered nifti files are publicly available.

Best, @liob