datasets / covid-19

Novel Coronavirus 2019 time series data on cases
https://datahub.io/core/covid-19
1.16k stars 605 forks source link

Automate keeping data up to date by pulling data from upstream #11

Closed nirabpudasaini closed 4 years ago

nirabpudasaini commented 4 years ago

We want to automate collecting the data every day (or even every half-day?). Since upstream repo is update at 23:59 GMT (once a day), we can run our update script right after that time, eg, 00:00 GMT.

Acceptance criteria

Tasks

Future

larsonreever commented 4 years ago

this will help us a long way as we are coming up with a community based solution to track coronavirus near you - initiative named Corona Warriors, it will be an innovative step ahead to help the spread. Github has lots of good resources which we can levarage. Contributions & partnerships are welcome.

anuveyatsu commented 4 years ago

Mostly done - will close it once scheduled work succeeds. Note that we're now running the jobs every 6 hours instead of 24 hours because the upstream repo is not updated on regular basis.

anuveyatsu commented 4 years ago

FIXED see https://github.com/datasets/covid-19/actions/runs/61404499

zelima commented 4 years ago

Smth wrong, last 2 workflows are broken https://github.com/datasets/covid-19/actions

anuveyatsu commented 4 years ago

@zelima that was due to empty commits, eg, no changes in the upstream. I've now fixed it to succeed if here are no changes, see https://github.com/datasets/covid-19/commit/26567f9e71323968943815dce55326c9ea9be94c