sfbrigade / data-covid19-sfbayarea

Manual and automated processes of sourcing data for the stop-covid19-sfbayarea project
MIT License
8 stars 10 forks source link

HOTFIX: handle repeat rows in Solano age groups #161

Closed Mr0grog closed 3 years ago

Mr0grog commented 3 years ago

The last day of Solano's data seems to be getting repeated in their dataset (every row is repeated once, totally verbatim, as far as I can tell). I'm not sure why that's happening (it's there that way in the source data if you go look on the web), but it seems benign, so I've just added a fix here to de-duplicate the age groups (we just take the last value we have for each group). This already happens naturally for race/ethnicity and gender (since we build dicts instead of lists), so there's no need to fix those.

Mr0grog commented 3 years ago

Rebased on master.

@benghancock it looks like it’s working for me again now. How about you?

benghancock commented 3 years ago

Yep, working now! Good to go.