GSS-Cogs / family-covid-19

0 stars 0 forks source link

PHE-Coronavirus-COVID-19-in-the-UK #36

Open ajtucker opened 4 years ago

RobThomlinson commented 4 years ago

BE Data Spec Review 10/6/2020

LPerryman commented 4 years ago

FileNotFoundError: [Errno 2] File b'coronavirus-cases_latest.csv' does not exist: b'coronavirus-cases_latest.csv'

RedWalters commented 4 years ago

Might need to go back to BAs, the landing page and dataset has been refactored and one of the two datasets (coronavirus-deaths_latest) has been discontinued. Also the legacy download for coronavirus-cases_latest cannot be downloaded directly from the site due to what seems to be dodgy hosting.

RedWalters commented 4 years ago

"there has been significant change to this dataset and current priority has moved to Towns & High Streets, we've decided to put the tech stage for this on Airtable to "hold", BA stage as "To Document"."

LPerryman commented 4 years ago

Data has been published for cases by specimen date: Main dataset has been taken from here in CSV form for Cases by specimen date, by nation: https://coronavirus.data.gov.uk/cases This has cases by nation for all nations

but a legacy down load has also been used from here: https://coronavirus.data.gov.uk/about-data#cases-by-lower-super-output-area-lsoa https://coronavirus.data.gov.uk/downloads/csv/coronavirus-cases_latest.csv it states on the page that this data will be updated for the foreseeable future This only has cases for England by LTLA, UTLA and Regional

Not clear exactly what legacy data represents but both files have been compared and numbers match up exactly for England so have assumed legacy file is also for specimen by date.

Data still needs to be published for Deaths

LPerryman commented 4 years ago

I have not created a spec for this transform as the CSV is already in datacube format. I have just removed the cumulative and rate data as we can only publish one Measure Type at present, in this case a count of cases.

ajtucker commented 4 years ago

@RobThomlinson , @david-hull , @Tracey-B : landingPage in Airtables has some extraneous ?_ga=, is it needed?

LPerryman commented 4 years ago

After speaking to Dave H landing page is changing to https://coronavirus.data.gov.uk/about-data as this holds historical data for coronavirus cases and will be updated.