Open just2610 opened 4 years ago
@just2610 hi! Sure, I would be happy to add in the cluster/linked cases/gender fields. In fact, the case-cluster relationship data is currently a WIP. The branch name is feature/covid-migrant-dorm-support
, and here is an example of the sample data (https://raw.githubusercontent.com/wentjun/covid-19-sg/feature/covid-migrant-dorm-support/src/data/covid-sg.json). There are no changes to the other locations related data.
As for case-case links, I am still trying to figure out the schema, but I would be happy to hear any suggestions.
@wentjun many thanks. Great to hear this. For case links, one schema is to use the Ministry's format
There are two separate aspects: specific link and the cluster. links can be chained - and one to many. So a list structure might be useful to capture the depth.
Yeahh I was definitely thinking of making a list/array of linked cases to each case! That will be my next step.
Am thinking of making a single nested tree/graph structure to visualise the relationship, but I think that will require more planning too. Anyways, regarding the mapping of genders and linked clusters to each case, it will most likely be released tonight, after some tests.
Excellent! Do let me know if I can help in any way.
@wentjun also noticed that occasionally the data Is missing the Nationality
See case 6588 - Nationality should be Bangladeshi.
@just2610 thanks for highlighting that. All of them should be ok by now!
@just2610 regarding your concern about discharged dates, there are some inconsistencies in the reports by the MOH. I will be dropping them an email to enquire about it.
As for the gender and hospitals, I will clean up the falsy-ish values tonight! Some of them are printed as "0" because the values are not provided, and clearly I did not check them thoroughly :)
Thank you for an excellent dataset. Could I suggest that you look to add gender to the dataset. Linked cases would also be great - so that one could look at building cluster level analysis.