ieee8023 / covid-chestxray-dataset

We are building an open database of COVID-19 cases with chest X-ray or CT images.
3k stars 1.28k forks source link

Recommended datasets for transfer learning #28

Open oplatek opened 4 years ago

oplatek commented 4 years ago

Hi @ieee8023

thank you for maintaining this dataset!

I implemented a pytorch lightning wrapper for a DenseNet model for covid-chestxray-dataset.

It is kick of a Pytorch Lightning's community project which aims at to be covid19 detector (for educational purposes).

Can you recommend us datasets and strategies on how to use additional data

I have scanned https://arxiv.org/pdf/2002.02497.pdf (I will return to it). It seems that to solve the labeling differences and other dataset preparation differences quite a lot of domain expertise is needed. Any tips appreciated.

Kind regards

Ondra

PS: I was inspired by #15 PPS: My fork was merged to the PyTorchLightning community project PPPS: I believe that @borda already contacted you that we may use slack for longer discussions if needed. Link to the slack can be found at PL

pkienzle commented 4 years ago

Not a recommendation, but here are some links to datasets that I found:

I also found a list of public medical imaging data collections, not restricted to lung X-ray at https://www.radrounds.com/profiles/blogs/list-of-open-access-medical-imaging-datasets radrounds.com

ammarchalifah commented 4 years ago

Hi Oplatek and Pkienzle, thank you so much for the question and the answer! Currently I'm trying to train a covid chest x-ray detection model. I have trained the classifier with Kaggle dataset. Now I want to train the classifier using covid, pneumonia, and normal data. I think I want to use covid data from this repository, but I'm having difficulty to find normal data. Are there any reference or suggestion?

Thank you so much!

ieee8023 commented 4 years ago

Please check out this paper for a transfer learning approach and tasks to work on (and the clinical workflows that could benefit from tools): http://arxiv.org/abs/2006.11988

Also check out this library for dataloaders for over 7 different datasets: https://github.com/mlmed/torchxrayvision