lindawangg / COVID-Net

COVID-Net Open Source Initiative
Other
1.15k stars 480 forks source link

Missing RSNA images #57

Closed vakker closed 3 years ago

vakker commented 4 years ago

RSNA has ~26k images, but COVIDx has ~13k samples. Could you include some explanation on how the samples are chosen?

lindawangg commented 4 years ago

From the data script, https://github.com/lindawangg/COVID-Net/blob/master/create_COVIDx_v3.ipynb, the normal xrays were taken from one csv file and the pneumonia xrays were taken from another. The normal xrays were all label 'Normal' in the csv file and the pneumonia xrays were all labeled '1' in the other csv file. This is to eliminate the xrays that aren't normal and aren't pneumonia.