-
## š Bug
In our project [Tencent/PatrickStar](https://github.com/Tencent/PatrickStar), we need to create a NCCL comm group and a Gloo comm group in order to utilize both GPU reduce scatter and CPU ā¦
-
Hi,
I'm getting this error when loading my package/scripts on linux server `using Distributed` with multiple cores.
I don't understand why is it happening: `check for typos in package module nameā¦
-
I am running distributed Tensorflow with GRPC protocol on only CPUs. I enabled distributed_all_reduce type of variable update with 'all_reduce_spec = xring':
I am wondering, if this mode is supposā¦
ghost updated
2 years ago
-
### What happened + What you expected to happen
I tried to use the **debugger plug-in on VScode** according to the guidance(https://www.anyscale.com/blog/ray-distributed-debugger), but when I click oā¦
-
# Adding AI primitives to Nostr
### We have built AI primitives for Nostr events including: summarize as bullets, summarize as paragraph, translate, measure sentiment, compare and contrast events, grā¦
-
Trying to work around this https://github.com/Open-Systems-Pharmacology/MoBi/issues/1290
I have created the missing parameters in the SS BB with default values 0. When created the simulation, the vā¦
-
Hi there,
I am encountering this issue while using a NCCL backend.
Seems like dist.barrier() is the problem.
```
File "maindams.py", line 928, in parse_args
dist.barrier()
File "/opt/conda/eā¦
-
I tried to create a MWE that was closer to the actual workflow I'm working with. I'm guessing the errors occurring here are related to #437 (one of the four reported errors below is the same as the liā¦
-
### What happened?
I have a large-ish weather dataset (mirrored version of a subset of WeatherBench2 data), which is stored on-disk as a collection of Zarr datasets, segregated by time. Because prā¦
-
When I run the test on my system, it works fine. The nodes are running on debian stable.
```
work@voyager:~/random/funkload$ fl-run-bench test_example.com.py example.com.test_example.com --distributeā¦