Open liuyanguu opened 3 years ago
Thanks @liuyanguu for reporting. It would seem some of these artifacts have different causes, a mix of inconsistent sources, and surely a few manual data entry errors (Jordan). To pick out the manual entry errors, jumps in daily fractions where one age pops in the opposite direction of the others serves as a good indicator. If there is a daily spike but no change in age-specific fractions, then it's a potential error in the registered total. In that case, scaling to an external consistent series of totals would cure it (we can do this internally once it's identified). Kazahkstan looks like it needs its own investigation. Let's leave this issue open as cases are addressed. cc-ing @jessicadonzowa @kikeacosta
I just realized this interesting issue since my colleagues were asking for time series of cases/deaths from the database. Because the database was built in an accumulative way, it is possible to get time series for some countries that (luckily) have frequently data updates.
This is not common as data for most countries are quite consistent. But happened in several countries like the deaths of Indonesia. I plotted the deaths against different update dates and they should monotonically increase as in most countries. I assume this bump is caused by a change in the data source?
The effect is less obvious if zoom out to all age groups:
Just to give some other examples: