robert-koch-institut / COVID-19-Todesfaelle_in_Deutschland

Im Datensatz 'COVID-19-Todesfälle in Deutschland' werden die Todesfälle in Bezug auf COVID-19 in Deutschland bereitgestellt. Darüber hinaus wird neben der Anzahl der übermittelten Todesfälle der Fall-Verstorbenen-Anteil berechnet. Angaben zum Tod zählen zu den melde- und übermittlungspflichtigen Inhalten. Bei der Ermittlung von Todesfällen und d...
https://robert-koch-institut.github.io/COVID-19-Todesfaelle_in_Deutschland/
Creative Commons Attribution 4.0 International
7 stars 4 forks source link

Covid19 Todesfaelle in WHO #1

Closed dkobak closed 5 days ago

dkobak commented 1 year ago

I am trying to understand the German Covid-19 deaths data in the WHO dataset (https://covid19.who.int/data). WHO dataset contains daily data on Covid deaths across many countries. The German data there is different from the RKI data by "Meldedatum": it appears to be shifted by around 2 weeks earlier:

germany-daily-who-rki

Curiously, the total cumulative sum is the same.

Furthermore, I compared both datasets with the RKI deaths data by "Sterbedatum" (which is only available with weekly resolution). The WHO data appears to be shifted by around 1 week earlier compared to the RKI-Sterbedatum, which does not make any sense to me.

germany-rki-who

I have tried reaching out to WHO to find out the exact source of German data that they use, but unfortunately did not receive any reply. So I thought maybe somebody at RKI knows the origin of the WHO data.

HannesWuensche commented 1 year ago

He @dkobak,

thank you for reaching out to us about the matter. The problem you refer to is described by the WHO:

Caution must be taken when interpreting all data presented, and differences between information products published by WHO, national public health authorities, and other sources using different inclusion criteria and different data cut-off times are to be expected.

In the "Data Sources" section I could not find any explicit reference on how the German data was collected. I forwarded the question to my colleges in charge of the German Covid-19 deaths. Accordingly, we will try to reach out to WHO too.

Some general remarks:

Best regards @HannesWuensche for the Team RKI | Open Data

dkobak commented 1 year ago

Thank you @HannesWuensche.

The blue curves in my figures above used this dataset: https://github.com/robert-koch-institut/COVID-19-Todesfaelle_in_Deutschland/blob/main/COVID-19-Todesfaelle_Deutschland.csv, so I guess I should have named that "Berichtsdatum" instead of "Sterbedatum".

In the "Data Sources" section I could not find any explicit reference on how the German data was collected. I forwarded the question to my colleges in charge of the German Covid-19 deaths. Accordingly, we will try to reach out to WHO too.

That would be great. I am really curious where the WHO gets the German deaths data from, and why does it differ from both RKI-Berichtsdatum and RKI-Sterbedatum. If you manage to find out anything, please let me know.