cmudig / AutoProfiler

Automatically profile dataframes in the Jupyter sidebar
BSD 3-Clause "New" or "Revised" License
339 stars 11 forks source link

Plot bivariate charts #95

Closed willeppy closed 1 year ago

willeppy commented 1 year ago

Want ability to plot multiple columns together rather than only univariate information. This is very common task so would see lots of use, however needs careful design so as not to violate the minimal nature of AutoProfiler currently.

Example

If I have dataframe with columns: [x, y]. I want to be able to plot x vs y

willeppy commented 1 year ago

One potential design is for all columns in a dataframe you can use a dropdown, or dragging, to combine with other columns. Then this plot reactively updates so long as the input columns are still in the dataframe. This specification would happen entirely through the UI

willeppy commented 1 year ago

Another option is to do so programatically:

import digautoprofiler
digautoprofiler.plot(df['x'], df['y'])

This seems worse than the first option at first glance

willeppy commented 1 year ago

Putting here for tracking, but I think out of scope for now is considering 3D data. For instance, I have, x, y, z data that I want to plot together. I think for initial version we will limit to just bivariate