wentjun / covid-19-sg

Visualising COVID-19/Coronavirus cases and cluster zones in Singapore
https://covid-tracker.com/
GNU General Public License v3.0
16 stars 4 forks source link

A suggestion to add Gender #5

Open just2610 opened 4 years ago

just2610 commented 4 years ago

Thank you for an excellent dataset. Could I suggest that you look to add gender to the dataset. Linked cases would also be great - so that one could look at building cluster level analysis.

wentjun commented 4 years ago

@just2610 hi! Sure, I would be happy to add in the cluster/linked cases/gender fields. In fact, the case-cluster relationship data is currently a WIP. The branch name is feature/covid-migrant-dorm-support, and here is an example of the sample data (https://raw.githubusercontent.com/wentjun/covid-19-sg/feature/covid-migrant-dorm-support/src/data/covid-sg.json). There are no changes to the other locations related data.

As for case-case links, I am still trying to figure out the schema, but I would be happy to hear any suggestions.

just2610 commented 4 years ago

@wentjun many thanks. Great to hear this. For case links, one schema is to use the Ministry's format

image

There are two separate aspects: specific link and the cluster. links can be chained - and one to many. So a list structure might be useful to capture the depth.

wentjun commented 4 years ago

Yeahh I was definitely thinking of making a list/array of linked cases to each case! That will be my next step.

Am thinking of making a single nested tree/graph structure to visualise the relationship, but I think that will require more planning too. Anyways, regarding the mapping of genders and linked clusters to each case, it will most likely be released tonight, after some tests.

just2610 commented 4 years ago

Excellent! Do let me know if I can help in any way.

just2610 commented 4 years ago

@wentjun also noticed that occasionally the data Is missing the Nationality

image image

See case 6588 - Nationality should be Bangladeshi.

wentjun commented 4 years ago

@just2610 thanks for highlighting that. All of them should be ok by now!

wentjun commented 4 years ago

@just2610 regarding your concern about discharged dates, there are some inconsistencies in the reports by the MOH. I will be dropping them an email to enquire about it.

As for the gender and hospitals, I will clean up the falsy-ish values tonight! Some of them are printed as "0" because the values are not provided, and clearly I did not check them thoroughly :)