Closed jmoralez closed 2 months ago
Uses pd.Series.value_counts(dropna=False, sort=False) instead of df.groupby(col, observed=True).size() which is over 2x faster.
pd.Series.value_counts(dropna=False, sort=False)
df.groupby(col, observed=True).size()
Check out this pull request on
See visual diffs & provide feedback on Jupyter Notebooks.
Powered by ReviewNB
Uses
pd.Series.value_counts(dropna=False, sort=False)
instead ofdf.groupby(col, observed=True).size()
which is over 2x faster.