owid / covid-19-data

Data on COVID-19 (coronavirus) cases, deaths, hospitalizations, tests • All countries • Updated daily by Our World in Data
https://ourworldindata.org/coronavirus
5.66k stars 3.64k forks source link

possible errors in covid and all-cause mortality death counts #1550

Closed czka closed 3 years ago

czka commented 3 years ago

In my script I aggregate daily covid death counts (column new_deaths in owid-covid-data.csv) into weekly or monthly, and substract the aggregated value from the (weekly or monthly, depending on a country) all-cause mortality (deaths_2020_all_ages in excess_mortality.csv).

I'm getting strange/impossible results for few countries at certain dates:

I did my best to rule out any errors in my script (it's quite simple if you want to take a look). Could you help me find out whether the culprit for these singular artifacts on these 7 charts might be due to some errors in OWID's data, or the sources you rely on? Remaining 84 charts for 2020 and all 91 charts for 2021 don't have any such obvious quirks, so I'm guessing these could be errors in the data indeed.

edomt commented 3 years ago

Hi @czka

We source our data on COVID-19 confirmed cases and deaths from the COVID-19 Data Repository by the Center for Systems Science and Engineering (CSSE) at Johns Hopkins University: https://github.com/CSSEGISandData/COVID-19. This page also lists their sources, country by country. We report this data without alterations, but issues can be raised in their GitHub repository: https://github.com/CSSEGISandData/COVID-19/issues

That being said, there are two main caveats to the comparisons you're making here:

czka commented 3 years ago

@edomt Yes, I know where OWID are taking their data from. I was hoping OWID would forward my report as needed. Thanks for your interest though.

all the excess deaths counted during a period don't have to be COVID deaths

Are you telling me this because you are under impression my charts claim otherwise? This wasn't my intention.

excess mortality can sometimes be lower than the official number of COVID deaths

I'm confused why you mention excess mortality again. I'm not showing excess mortality as such on my charts. I'm showing: all-cause mortality (solid black), all-cause mortality minus covid mortality (dashed black) and all-cause mortality in previous years as a background/reference for these.