iceweasel1 / COVID-19-Germany

All officially confirmed cases of infection in Germany. Listed by federal state, case number, date, county and source.
66 stars 11 forks source link

best data source for fine-grained data from Germany? #10

Open jgehrcke opened 4 years ago

jgehrcke commented 4 years ago

Hey. I found this and it looks great. However, the repo already seems to be out of date. What's the best fine-grained source for data from Germany?

update: https://github.com/jgehrcke/covid-19-germany-gae

luxxyarns commented 4 years ago

seems as if nobody officially decides to provide anonymous detail data to developers for this incredibly dangerous issue - in South Korea people were able to create useful apps, but it seems as if we don't get that kind off data -- any ideas?

almereyda commented 4 years ago

The data can be scraped from the PDFs of the official situation report at RKI.

https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Situationsberichte/Gesamt.html

https://www.rki.de/DE/Content/InfAZ/N/Neuartiges_Coronavirus/Situationsberichte/Archiv.html

Also wondering where and how open public data is going to be published about this.

Not on https://www.govdata.de/ as far as I can tell.

We can track https://github.com/CSSEGISandData/COVID-19/issues/397 and https://github.com/CSSEGISandData/COVID-19/issues/826 for further discoveries.

jgehrcke commented 4 years ago

Here I explain why the RKI numbers are sometimes affected by a 1-2 day delay from what the individual ministries of health in Germany actually publish: https://gehrcke.de/2020/03/covid-19-http-api-for-german-case-numbers/

That blog post also describes how to use an HTTP API for getting the current case count, based on zeit.de data.

gist of it:

$ curl https://covid19-germany.appspot.com/now 2> /dev/null | jq
{
  "current_totals": {
    "cases": 9348,
    "deaths": 25,
    "recovered": 72,
    "tested": "unknown"
  },
  "meta": {
    "contact": "Dr. Jan-Philip Gehrcke, jgehrcke@googlemail.com",
    "source": "zeit.de (aggregates data from individual ministries of health in Germany)",
    "time_source_last_consulted_iso8601": "2020-03-18T00:11:24+00:00",
    "time_source_last_updated_iso8601": "2020-03-17T21:22:00+01:00"
  }
}

Say at a certain moment in time you get the current case count from all the individual ministries of health in Germany. Then you get a credible case count sum for Germany for that moment in time This is what zeit.de seems to be doing. RKI is not quite doing that.

So: for the current case count in Germany, use the HTTP API above. For historical data use the data in this repo here. I think that's a fair approach.

jgehrcke commented 4 years ago

If you'd like to have a look, here I announce an HTTP API that provides time series data for individual German states: https://gehrcke.de/2020/03/covid-19-http-api-german-states-timeseries/

https://github.com/jgehrcke/covid-19-germany-gae

Feedback welcome!

luxxyarns commented 4 years ago

this is new: https://npgeo-corona-npgeo-de.hub.arcgis.com/datasets/dd4580c810204019a7b8eb3e0b329dd6_0/geoservice?orderBy=AnzahlFall&orderByAsc=false

finegrained data for Germany ...