Closed davesgonechina closed 3 years ago
Hi @davesgonechina, Thanks for reporting this issue! This looks like an issue with how we're handling LuxSeriesGroupBy. I believe Lux handles the case of df.groupby("some_column")["some_other_column"]
, but it seems like there is an issue with the result being called with describe
(since the return type is a Pandas dataframe). We will look into this and get back to you once we pin down the bug.
Related, it appears describe()
doesn't work with Lux at all?
Using df = pd.read_csv("https://raw.githubusercontent.com/lux-org/lux-datasets/master/data/college.csv")
, df
will yield a Lux button in my Jupyter notebook, but df.describe()
is just a plain Pandas data frame with no Lux button.
Hi @davesgonechina, We previously disabled df.describe
since the visualizations were not very meaningful.
I've added the Lux capabilities back in df.describe
for consistency. I've also fixed the bug related to groupby describe
df = pd.read_csv("https://raw.githubusercontent.com/lux-org/lux-datasets/master/data/college.csv")
df.groupby("FundingModel")["AdmissionRate"].describe()
We will look into displaying more useful recommendations (e.g. box plots) for df.describe
in the future.
These changes will get merged in soon with our latest release. Let us know if this fixes the issue that you're seeing!
https://github.com/lux-org/lux/blob/65fa234143a57adcf25414713c85deebe3f2c5cd/lux/core/groupby.py#L71
Running
df.groupby("some_column")["some_other_column"].describe()
in a Jupyter notebook results in:At first I thought this might be a Jupyter problem but increasingly I suspect it is related to how
LuxSeriesGroupBy
is handled?