-
**Describe the issue**:
I understand your [documentation](https://docs.dask.org/en/stable/generated/dask.dataframe.merge.html) suggests `merge` allows a `validate` argument, like the `pandas` […
-
Hello,
As discussed in [this topic on Dask's forum](https://dask.discourse.group/t/dask-ml-kmeans-optimization/3501), my colleague and I compared in a distributed environment the `dask-ml` implemen…
-
# Behaviors of Flowcept CLI:
### 1. `$> flowcept config`
This will create a new settings file in `~/.flowcept/settings.yaml` with the default settings. A message will be print to make the user awa…
-
Unless I missed it, there is no way to sort or argsort dask arrays, is it a current work in progress ?
-
I wonder if the use of dask is sufficiently complex that it warrants its own section? Using this issue to document lesser-known tools/hacks that could potentially be included:
- [dask Active Memory…
-
## Use Case
Ability to create and use Dask-Gateway (https://gateway.dask.org/) clusters within dagster.
Dask-Gateway provides a useful way of centralizing the creation of dask clusters while still…
-
The `dask.delayed` decorator should have type annotations.
```py
from dask import delayed
@delayed(pure=True)
def well_typed_function() -> int:
return 1
```
mypy: Untyped decorator make…
-
I'm unable to get a simple example using spacy + dask.distributed up and running. In the context of a Jupyter Notebook, I recieve this error:
```
OSError: [E050] Can't find model 'en_core_web_lg.v…
-
`aggregate()` is not exploiting dask partitions and parallelization; using it may improve performance. We could make some experiments/benchmarks.
-
**Describe the bug**
The warning
```
2024-10-11 00:04:31,529 - distributed.worker.memory - WARNING - Unmanaged memory use is high. This may indicate a memory leak or the memory may not be released t…