MIT-LCP / mimic-code

MIMIC Code Repository: Code shared by the research community for the MIMIC family of databases
https://mimic.mit.edu
MIT License
2.43k stars 1.5k forks source link

Large overlap of "AP" and "CHEST (PA and LAT)" images in MIMIC-CXR #1513

Closed maximek3 closed 1 year ago

maximek3 commented 1 year ago

Prerequisites

Description

Hi, I have found that there is an overlap of 22,043 chest X-rays in MIMIC-CXR that have as ViewPosition AP but their PerformedProcedureStepDescription is given as CHEST (PA and LAT). Are those X-rays all the ones that are AP, but not taken with a portable scanner?

Thanks for your help!

alistairewj commented 1 year ago

Hmm interesting. I would instinctively trust the ViewPosition - it's required to be present for all the images. No guarantees as I haven't investigated that subset of images in detail.

maximek3 commented 1 year ago

Thanks for getting back on this.

It seems that >99% of images have either CHEST (PA and LAT) or CHEST (PORTABLE AP) as value for ProcedureCodeSequence_CodeMeaning.

Would it be fair to assume that whenever an image is both AP and CHEST (PA and LAT), it's an AP scan taken with a fixed scanner? And when it's AP and CHEST (PORTABLE AP), it's an AP scan taken with a portable scanner?

alistairewj commented 1 year ago

That's what I would do. And then I'd check ~10-20 records to make sure it's consistent with what I see. For a while we had a model which classified images as PA or AP, since it's a very easy classification problem, so that's an alternative option.

maximek3 commented 1 year ago

Thanks for the pointers, I did this, and here is what I found:

For 20 studies that have both AP and CHEST (PA and LAT):

For 10 studies that have AP and CHEST (PORTABLE AP), 8 clearly indicate in the report that the scans were taken on a portable scanner. For the remaining 2 there is no information excluding this.

All in all, it does seem like the assumptions below is correct :)

Would it be fair to assume that whenever an image is both AP and CHEST (PA and LAT), it's an AP scan taken with a fixed scanner? And when it's AP and CHEST (PORTABLE AP), it's an AP scan taken with a portable scanner?