mlmed / torchxrayvision

TorchXRayVision: A library of chest X-ray datasets and models. Classifiers, segmentation, and autoencoders.
https://mlmed.org/torchxrayvision
Apache License 2.0
857 stars 211 forks source link

Do you have models trained and/or evaluated on Chest ImaGenome? #120

Open PabloMessina opened 1 year ago

PabloMessina commented 1 year ago

Hi, just a very quick question. Chest ImaGenome (https://physionet.org/content/chest-imagenome/1.0.0/) provides very fine-grained labels and bounding boxes for most images in MIMIC-CXR. Do you guys have models trained and/or evaluated using this dataset?

Best regards, Pablo

ieee8023 commented 1 year ago

Currently no. But it looks like a very awesome dataset that we should have. Here is a notebook showing the current datasets with mask information: https://github.com/mlmed/torchxrayvision/blob/master/scripts/xray_masks.ipynb

PabloMessina commented 1 year ago

That's awesome. You already support several datasets with mask and bounding box annotations. The cool thing about Chest ImaGenome is that it comes with bounding boxes for 36 different anatomical locations + very fine-grained scene graphs describing frontal chest X-ray images, for 240K+ images of MIMIC-CXR, so it's a very large scale dataset. You can read more about how they developed the dataset here: https://arxiv.org/pdf/2108.00316.pdf

Here are some papers that have already used Chest ImaGenome, to give you an idea of the things that can be done with it:

ieee8023 commented 1 year ago

Thanks for that info! Do you know how to work with the data already? Can you help me get started with a dataset that loads the masks similar to the existing datasets to prepare a PR?

PabloMessina commented 1 year ago

I've been playing around with the dataset for a while, but I have my own ad-hoc customized ways to load and post-process the data. I can point you to specific sections of my code if that helps though. For example:

PabloMessina commented 1 year ago

This jupyter notebook might be helpful as well: https://github.com/PabloMessina/MedVQA/blob/master/medvqa/datasets/notebooks/Exploring%20Chest%20ImaGenome.ipynb (Note: it's a work in progress)

PabloMessina commented 1 year ago

Hey @ieee8023, just so you know, there is another paper recently published in CVPR 2023 using the Chest ImaGenome dataset: Interactive and Explainable Region-guided Radiology Report Generation: