qxiaobu / FLANNEL

3 stars 4 forks source link

get_covid_data_dict.py - Incorrect Column Numbers #1

Open beyerch opened 3 years ago

beyerch commented 3 years ago

The column numbers referenced in the get_covid_data_dict.py routine do not seem to align with the metadata.csv file for the covid-chestxray-dataset-master. Perhaps the xray data has changed? Could use some pointers as to what subject_id & disease correspond to.

Is subject_id = offset disease = finding

?

image

qxiaobu commented 3 years ago

I think the metadata.csv has been updated. I used a littile older one as below,

image

I think you can re-write the part to extract and organize data according to the new metadata.csv

beyerch commented 3 years ago

For reference, here are the updated columns based on the data acquired on 4/11/2020 metadata file:

patient_id = row[0] subject_id = row[1] view = row[18] image_name = row[23] disease = row[4] modality = row[19]