-
**Describe the bug**
When an IntCategory axis with growth=False and overflow=False is filled with values that fall outside the defined bins (which should be a valid combination, and is useful in ce…
-
Hello, 😄
I'm a data scientist/data engineer working on a specific workflow where I need to process a huge amount of documents so i tried using `distributed.Queue` to right a producer/consumer p…
-
**What happened**:
KilledWorker exception for a rather small computation using dask dataframes.
We schedule a dask computation based on simulated data as part of our integration tests. In our te…
-
The "Graph" or "Task Graph" visualisation on the web UI seems to only show a visualisation when there's a running job on the dask cluster.
Is there a way to make this visualisation persistent? Or t…
-
**What happened**:
Dask scheduler is timing out
**What you expected to happen**:
Return result
**Minimal Complete Verifiable Example**:
```
from xxxxx.distributed.ecscluster import EcsCluste…
-
It seems that stackstac will occasionally hang indefinitely while doing a dataset read:
![image](https://user-images.githubusercontent.com/22163508/123011187-efaeec80-d38d-11eb-9c35-f5f4a2866f23.png)…
-
I am trying to create dask cluster using this code:
https://github.com/frenchlam/dask_CDSW/blob/master/03_Dask_ML-LargeDS.ipynb
But it gives me the error when starting up the cluster:
Timed out t…
-
In dask-cuda we [benchmark how Dask+RAPIDS (cuDF)](https://medium.com/rapids-ai/high-performance-python-communication-with-ucx-py-221ac9623a6a) performs merges with a combination of accelerated comput…
-
The locate_unlinked() function is a bottleneck for various analyses. It should be possible to write a multi-threaded implementation using dask.
-
`max_nbytes` sets the threshold that triggers memmapping of numpy arrays in `joblib` for process-based backends.
The `dask` backend also has a data transfer optimization feature (auto scattering), th…