COVID-19-electronic-health-system / Coronalytics

CoronaTracker analytics service
MIT License
5 stars 3 forks source link

Ammend Typos and Expand Mock-Data #15

Open salvolpe opened 4 years ago

salvolpe commented 4 years ago

I'm beginning to preprocess the data and I've noticed that there are a couple of one-letter typos in the labels:

Also, I wanted to expand the data to include:

ngiangre commented 4 years ago

Do we need to update the mock data based on the translations update @pavel-ilin?

pavel-ilin commented 4 years ago

Do we need to update the mock data based on the translations update @pavel-ilin?

Yes we do need to modify mock data. I can start working on that on Monday.

ngiangre commented 4 years ago

@salvolpe can you update on how usable the previous mock data was for your analysis? And recommend any useful modifications? Also update what analysis you did :)

salvolpe commented 4 years ago

@ngiangre @pavel-ilin The mock-data really wasn't useful, unfortunately, because of the limited relationships between the features and near-pure randomness of the construction. I'd love to set up a time to chat this weekend on ways that we make the mock data more robust for a model (that hopefully) works

pavel-ilin commented 4 years ago

I believe that issue is not ready to be closed. In pr #20 only show the structure of the mock data.

Next step will be to add logic of how we generate data and what parameter increase probability of having a covid.