owid / etl

A compute graph for loading and transforming OWID's data
https://docs.owid.io/projects/etl
MIT License
83 stars 22 forks source link

Use `groupby(..., observed=True)` by default #3182

Closed Marigold closed 2 months ago

Marigold commented 2 months ago

When working with categorical variables, not using observed=True creates groups with no records. This is unintuitive and causing tons of headaches. We set observed=True manually pretty much everywhere, so it'd be nice if this was the default.

Pandas also throws a warning saying that observed=True will be default in the future.