djay / covidthailand

Thailand Covid testing and case data gathered and combined from various sources for others to download or view
126 stars 15 forks source link

Abnormal Vaccinations Data by Regions after Oct 4 2021 #137

Closed RuijingZ closed 2 years ago

RuijingZ commented 2 years ago

Hello,

I am also trying to look at vaccinations by regions from this path: https://raw.githubusercontent.com/wiki/djay/covidthailand/vaccinations.csv

I found that data after 10/4/2021 maybe wrong, like 'Vac Given 1 %' for Bangkok is supposed to be below 100, but it shows as 8001760 on 10/4/2021. I guess some columns might shift and mistakenly represent other concepts. Could you take a look at it? Thank you again! image

reduxionist commented 2 years ago

Yes, that definitely looks like some column shifting going on! Thanks for bring it to our attention. There have been a number of data issues to deal with today, some of which we are still working on, so I can't give you an ETA for when we will get to look at this one yet. I might have time this weekend, however...

RuijingZ commented 2 years ago

Yes, that definitely looks like some column shifting going on! Thanks for bring it to our attention. There have been a number of data issues to deal with today, some of which we are still working on, so I can't give you an ETA for when we will get to look at this one yet. I might have time this weekend, however...

Thanks for looking into it! I totally understand it takes time to maintain such a large dataset. Next week is fine, thanks again!

djay commented 2 years ago

@RuijingZ if you find any other days where the data looks wrong if you can list the specific days where that started and stopped we can add tests for those days and get it fixed. or if any data in the existing test days looks wrong. https://github.com/djay/covidthailand/tree/main/tests/vaccination_tables. for example it seems like https://github.com/djay/covidthailand/blob/main/tests/vaccination_tables/1633505880456.2021-10-05.json is wrong based on what you noticed above.

djay commented 2 years ago

@RuijingZ this should now be fixed. if there are any other specific dates that you notice the reports have different data than the csv then please raise a new ticket with an example date.