-
### What is your issue?
Chaining many processing on a huge (200GB+) dask-backed datasets lead to huge graphs (500MB+) being passed. More data, bigger the graph is, to the point where the graph is s…
-
I'm loading [a big bunch of tomographic data](https://github.com/habi/laminitis), crop out small regions, rechunk those with `chunks='auto'` and want to save them to disk with `.to_zarr`.
`dask.__v…
habi updated
22 hours ago
-
This is the same issue as https://github.com/rapidsai/dask-cuda/issues/1408 . Cross-posting here as it's more related to cuDF instead of `dask-cuda`.
The following snippet works with `DASK_DATAFRAME_…
-
https://github.com/dmlc/xgboost/actions/runs/11753771153/job/32747003155
```
E distributed.client.FutureCancelledError: ('_argmax-06657a445bd2e0d811c6ff48d5860817', 24) cance…
-
## Description
I am experiencing an issue when trying to reproject a `dask_geopandas.GeoSeries` or `dask_geopandas.GeoDataFrame` to a new CRS. The `set_crs` and `to_crs` methods return an `AttributeE…
dluks updated
2 weeks ago
-
Icechunk does support distributed writing of Arrays. However, currently Icechunk does not allow writing arrays via `dask.array.store` with distributed or multiprocessing schedulers.
This is becaus…
-
**Is your feature request related to a problem? Please describe.**
Current documentation shows small examples. Working on real large datasets varies in some ways and has specific needs:
- limiting Da…
-
### What kind of feature would you like to request?
Additional function parameters / changed functionality / changed defaults?
### Please describe your wishes
Meta issue tracking scanpy functions w…
-
Hey, I have been trying to build the docker image for autogluon for a while now but I keep running into missing requirements.
The current one seems to be missing dask.compatibility. I tried installi…
-
### Checklist
- [x] I added a descriptive title
- [x] I searched open requests and couldn't find a duplicate
### What is the idea?
Hello folks,
I've been maintaining the github.com/GoogleCloudData…