-
## ✨ Feature Request
Return a "long" DataFrame which retains all the metadata of the cube (either by default or using kwarg `table="long"`).
## Motivation
Currently `iris.pandas.as_data_frame` tu…
-
I am new to your project and want to write a strategy and start backtesting. I have already seen the comment in the backtesting documentation that the documentation isn't up to date at the moment. I r…
-
I have recently started to use Modin to accelerate operations when using Pandas DataFrames with Dask. I like the idea that by just changing a single line of code on my notebook, I can get the most fro…
-
**Motivation: Why do you think this is important?**
Flytekit should support Vaex as a pandas alternative for FlyteSchema object.
https://github.com/vaexio/vaex
Vaex has great performance on a sin…
-
Many DataFrame methods ([now including `__getitem__`](https://github.com/pydata/pandas/issues/11485)) accept callables that take the DataFrame as input, e..g, `df[lambda x: x.sepal_length > 3]`.
Howe…
-
Pandas DataFrames containing text columns are expensive to serialize. This affects dask.dataframe performance in multiprocessing or distributed settings.
### Pickle is expensive
In particular the cu…
-
**Describe the issue**:
pytest is failing in some units.
**Environment**:
- Dask version:
- Python version: 3.8.15
- Operating System: Linux x86/64
- Install method (conda, pip, source): VC…
-
### Description
In the Ray Dataset implementation of [`to_dask()`](https://docs.ray.io/en/latest/_modules/ray/data/dataset.html#Dataset.to_dask), there currently isn't a way to pass in a custom meta …
-
I am trying to use modin to speed up an image processing pipeline that uses pandas dataframes to store images in individual columns together with computed image properties in other columns. I am new t…
-
There appears to be an issue when resetting the index of a `dask.dataframe` after performing a `dd.groupby` when the resulting dask dataframe contains a multi index:
```
import pandas as pd
impor…