The goal of this pull request is to allow metadata parameters to be kept while doing group by operations.
It can be used for example when having a plot pipeline, to store information about plot configuration until we show it (with the spirit of ggplot/plotly pipelines in R)
The _group_by property could then also be registered as a dataframe metadata, which would simplify the copy of dataframes in dfply.
The goal of this pull request is to allow metadata parameters to be kept while doing group by operations.
It can be used for example when having a plot pipeline, to store information about plot configuration until we show it (with the spirit of ggplot/plotly pipelines in R)
The
_group_by
property could then also be registered as a dataframe metadata, which would simplify the copy of dataframes in dfply.More about
_metadata
in pandas: https://pandas.pydata.org/pandas-docs/stable/internals.html#define-original-properties