epiforecasts / covidregionaldata

An interface to subnational and national level COVID-19 data. For all countries supported, this includes a daily time-series of cases. Wherever available we also provide data on deaths, hospitalisations, and tests. National level data is also supported using a range of data sources as well as linelist data and links to intervention data sets.
https://epiforecasts.io/covidregionaldata/
Other
37 stars 18 forks source link

Add covid19 data hub wrapper #338

Closed joseph-palmer closed 3 years ago

joseph-palmer commented 3 years ago

This PR adds a Covid19DataHub class which pulls and cleans data from Covid19 Data hub, specifically files marked as raw data (https://covid19datahub.io/articles/data.html). rawdata-1, rawdata-2 and rawdata-3 get level 1 (country) level 2 (region) and level 3 (subregion) data respectively.

The data collected runs through our generic cleaning and processing steps (of course with cleaning selecting and aligning the correct columns for this process).

The names of regions perhapse need changing and region codes are a little odd. ATM iso_alpha_3 is used for country, iso_level_2 is used for region and iso_numeric is used for subregions. Perhapse we could link these with the codes Google uses in their dataset?

github-actions[bot] commented 3 years ago

👋 Thanks for opening this pull request! Can you please run through the following checklist before requesting review (ticking as complete or if not relevant).

Thank you again for the contribution. If making large scale changes consider using our pre-commit hooks (see the contributing guide) to more easily comply with our guidelines.