Closed TeeZA42 closed 3 years ago
This also affects: covid19za_provincial_cumulative_timeline_deaths.csv covid19za_provincial_cumulative_timeline_recoveries.csv
for dates 08-01-2021, 07-01-2021, 06-01-2021
@lrossouw Your "Scrape & update cumulative provincial data" is adding incorrect data.
I have a number of questions:
I notice the file naming convention changed from: https://www.nicd.ac.za/latest-confirmed-cases-of-covid-19-in-south-africa-05- jan-2021 to: https://www.nicd.ac.za/latest-confirmed-cases-of-covid-19-in-south-africa-06- january-20210
So maybe that effected the automation
I have fixed the stats by deleting the rows for 2020-01-08 and correcting the numbers for 2020-01-07. Hopefully, no automatic script is going to over write those now.
Re-opening issue as I still think there is an issue with possible an automation script of @lrossouw
Sorry guys I was just investigating. I will fix. It's indeed automated.
The error slipped in because the URL used to publish was changed. It is usually following this format: https://www.nicd.ac.za/latest-confirmed-cases-of-covid-19-in-south-africa-05-jan-2021/
But on the 6th it wasn't picking up because the format had changed to https://www.nicd.ac.za/latest-confirmed-cases-of-covid-19-in-south-africa-06-january-20210/
Note the written out january
.
On the 6th I manually changed the URL my script checks to the one shown above, so that it could capture the data,, but forgot to change it back. Which meant that it kept checking that URL and using it to produce data for the additional days.
I have several checks in place:
Posting exactly the same data passes all the checks. This is the first time it committed incorrect data and was due to human error.
I had shared details of this process when I implemented it in #767 when I implemented it.
I've fixed the problem and this can be closed.
Closing given no further comments.
covid19za_provincial_cumulative_timeline_confirmed covid19za_timeline_testing
repeated numbers