-
### Describe the bug
I run the training but get this error
### Reproduction
Run accelerate config
```
compute_environment: LOCAL_MACHINE
debug: false
distributed_type: FSDP
downcast_bf16: 'n…
-
Error [Shut down address: akka.tcp://127.0.0.1@LocalSystem:17372]
[ akka.remote.ShutDownAssociation: Shut down akka.tcp://127.0.0.1@LocalSystem:17372]
Caused by: akka.remote.transport.Transport$Inval…
-
I would like to suggest something for co-distribution pacts or international sub-distribution, **JUST** in case it hasn't happened already. If the studio is not placed under the same (partial) owners…
-
Same as above
-
The test `distributed/_tensor/test_convolution_ops.py::DistConvolutionOpsTest::test_depthwise_convolution` has been regularly failing on distributed builds, but out CI tools don't realize that this wa…
-
### 🐛 Describe the bug
Hello, I am working on a project where I need to use multiple consecutive instances of DistributedDataParallel (DDP) within the same torch.distributed environment. In my scen…
-
Currently rewards suffer do not work as intended when applied to a range of markets, such as all markets with the same settlement assst.
Users on long tail markets receive almost no rewards compare…
-
The current MOC ESI environment has a ubuntu 22.04 image available, but not a 24.04 one.
The [disk image builder documentation](https://docs.openstack.org/ironic/latest/user/creating-images.html) d…
-
### Benefit
UGRC will benefit immensely from a mechanism that allows us to collaborate on datasets with our authoritative stewards. For now, this mechanism is our Enterprise/Portal.
We made some pro…
-
When I generate a variable font, the `public.fontInfo['familyName']` doesn't seem to be considered during compilation.
I see there has been some work done on the fontTools side. Is there still some w…