Closed Mr0grog closed 3 years ago
Aaaaaaaand now that I finally filed this issue after it failing for a week, they seem to have fixed it. 🙄
This is still worth addressing, though — we’ve seen it before and will probably see it again. The best spot is probably to change utils.parse_datetime()
: https://github.com/sfbrigade/data-covid19-sfbayarea/blob/24269d83237bdbae9c632e71365b79426f6a1e02/covid19_sfbayarea/utils.py#L31-L50
The Santa Clara County data scraper is currently failing because the county’s dataset has a date from 1921:
We’ve seen similar issues before with dates from 1920 as well. They usually get resolved by the county, but it can take several days or more for that to happen, and it’s stupid for the scraper to be broken for that whole time.
I think it’s relatively safe to assume someone accidentally entered 2-digit year in some form where a 4-digit one was required, so we can safely change these to 2020 or 2021.