-
**What happened**:
After I merge two data frames on their index, then `reset_index` and try to access the column that was the index, I get an exception.
**Minimal Complete Verifiable Example**:
…
-
I feel like the question about datframes with distributed arrays comes up a lot. My impression is that we don't know, for sure, if a Dagger array etc. can "just work" as a column in a DataFrame.
I…
-
I would like to discuss how we can make it easier to utilize a cluster of mixed architectures -- focusing on mixing GPU and CPU tasks/workers.
### Background
Currently, a common setup is to ha…
-
**Minimal Code To Reproduce**
```
from fugue import transform
from dask.distributed import Client
client = Client() # without this, dask is not in distributed mode
from fugue_dask import DaskEx…
-
Dask has a tremendous amount of well-written documentation, but I am not sure if it is presented effectively (if anyone has any insight on this I'd love to get some data about it). These are some idea…
-
Just raising an issue to ask if this is still in use and still makes sense to use, sorry to bother
-
Along the lines of #215, it seems like there are quite a few parameter types that would be desirable to pass as inputs (most notably `pd.DataFrame`s) that are simple enough to translate to/from JSON. …
bnaul updated
5 years ago
-
From https://github.com/geopandas/dask-geopandas/issues/78, but I made a small standalone example to illustrate the issue (see below for the full example workflow).
If you have a workflow where you…
-
**What needs doing**
Improve user experience to learn how to do large joins. Customer feedback is that `JoinExternal` suggest that the operator can be used for joins between two large dataframes. In …
-
Currently if `Ensemble.from_dask_dataframe` loads a dataframe with an index with the same label as the column mapper, the following warning is produced:
`dask/dataframe/core.py:5251: UserWarning: N…