-
On Linux, Debian Buster, Julia 1.11.1 (started as `julia -p 4`)
Today, I first typed:
```julia
using Base.Threads
```
all good.
then, by mistake, I typed
```julia
using Distributef
```
(instead of `…
-
I was looking through a [flaky test report](https://github.com/dask/distributed/runs/6215173657?check_suite_focus=true) and saw this:
```python-traceback
--------------------------- Subprocess s…
-
### Issue Description
Hello,
Let's assume that we have a macvlan network and in that network we have a container with a static IP.
When redeploying the container (rm + run), the new container has t…
-
Maybe someone also encountered the same error during training with the frame dataset in parallel? The problem should not be because of the sequential sampling technique.
```
Epoch 2/30: 0%| …
-
I'm trying to create a wit interface, a type(record) is re-using another type(record) from wasi-http
I'm using wasmcloud dev tools, but still the error occurs on the bindgen macro
> error: failed …
-
### 🐛 Describe the bug
@IvanKobzarev was like "Dynamo can trace functional collectives, why isn't my code working?" And it turns out they were calling torch.distributed.nn.functional, which is not fu…
-
### 🚀 The feature, motivation and pitch
It is common to have a scenario where folks want to deploy multiple vLLM instances on a single machine due to the machine have several GPUs (commonly 8 GPUs). …
-
# Checklist
* [ ] I've searched the issue queue to verify this is not a duplicate feature request.
* [ ] I've pasted the output of `kargo version`, if applicable.
* [ ] I've pasted logs, if appli…
-
I am trying out the sagemaker notebooks on AWS. The third notebook, distributed training data parallelism, does not work. I believe the problem is that the training process on the 2nd doesn't start up…
-
@ericneiva
The moment fitting integration is not currently working on distributed.
I have done some progress in my fork [distributed_moment_fitting](https://github.com/pmartorell/GridapEmbedde…