openZH / covid_19

COVID19 case numbers of Cantons of Switzerland and Principality of Liechtenstein (FL). The data is updated at best once a day (times of collection and update may vary). Start with the README.
https://www.zh.ch/de/gesundheit/coronavirus/zahlen-fakten-covid-19.zhweb-noredirect.zhweb-cache.html?keywords=covid19&keyword=covid19#/
Creative Commons Attribution 4.0 International
424 stars 176 forks source link

AG: scrap weekly district data #1193

Closed balzamas closed 3 years ago

balzamas commented 4 years ago

Canton AG publishes total numbers by district: https://www.ag.ch/de/themen_1/coronavirus_2/lagebulletins/lagebulletins_1.jsp

The data is in a jpg, which gives quite good result using tesseract as OCR.

I currently download a daily copy of the jpeg to make further analysis, see example: ag_snapshots.tar.gz

Problems a) the publication rythm is not clear. Usually it was Friday, but for example this week they changed the numbers on Wednesday. b) there is no date information in the picture, only in the subtext. e.g. "Kanton Aargau – Inzidenz pro 100'000 Einwohner nach Bezirken seit Start Contact Tracing Center (Conti) 11. Mai 2020 (Stand: 13.10.2020, 08:20 Uhr)"

metaodi commented 3 years ago

@balzamas I implemented this scraper as suggested by you. Thank you! Let's see how stable it works over time.

balzamas commented 3 years ago

Thank you @metaodi ! Here are all the files I fetched, timestamp in the file name:

ag_fetch.tar.gz