GSS-Cogs / family-covid-19

0 stars 0 forks source link

Scraper investigation and implementation for Covid-19. #175

Closed ajtucker closed 3 years ago

ajtucker commented 3 years ago

2d each Shannon & JJ

ajtucker commented 3 years ago

The following datasets still use dataURL in their info.json files:

RedWalters commented 3 years ago

The following datasets still use dataURL in their info.json files:

RedWalters commented 3 years ago

WG-NHS-activity-and-capacity-during-the-coronavirus-COVID-19-pandemic #11 Currently remaining on seed as the seed points to an older ODS file than the most recent, and the most recent file seems to have some form of corruption which breaks the transform.

RedWalters commented 3 years ago

ONS-Number-of-deaths-in-care-homes-notified-to-the-Care-Quality-Commission-England #89 Scraper works but transform script needs updating (New tab in updated dataset)

RedWalters commented 3 years ago

WG-Testing-data-for-coronavirus-COVID-19 #28 Script added to take newest release, however, it has been commented out as the newest release has an issue stopping the file from being transformed Exception: invalid worksheet name "'file://HBA60/HomeB/stats/ETS1/Releases_Bulletins_Publications/Statistical%20Releases/2017/Exclusions/SDR%20XXX_2016%20English.xlsx'#SHCheck_Table03" Will need investigating

RedWalters commented 3 years ago

WG-Attendance-at-local-authority-settings-during-the-coronavirus-COVID-19-pandemic #12 Script added to pull in the most recent dataURL - transform needs updating

RedWalters commented 3 years ago

SG-Coronavirus-Covid-19-additional-data-about-adult-care-homes-in-Scotland #87 Added script to pull the latest dataURL in case of updated dataset

ajtucker commented 3 years ago

@mikeAdamss to look at whether the landing page makes its way through to the eventual metadata in PMD.

mikeAdamss commented 3 years ago

its not, temp scraper overwrites it. Addressing it here: https://github.com/GSS-Cogs/gss-utils/issues/107