-
## Proposed refactor
Some of the steps/workflows are repetitive across workflows/repositories, which makes it very time-consuming to make sure that all of our workflow files are updated in the same w…
-
## Proposed refactor
Tests written for `horovod` strategy, might be outdated as they were mostly written ~2 years back.
- A lot of tests just check if the `horovod` run finished without any erro…
-
Oleh Rybkin et. al. [Plan2Explore: Active Model-Building for Self-Supervised Visual Reinforcement Learning](https://bair.berkeley.edu/blog/2020/10/06/plan2explore/).
Sekar R, et al. [Planning to expl…
-
## 🚀 Feature
We need a mechanism to set the epoch on the distributed sampler via `.set_epoch()`.
### Motivation
To correctly handle shuffling with the DistributedSampler in DDP, the PyTorch …
-
Hi I'm trying to build this AUR and it appears it is not happy with bazel 5.4.0. I have the dependencies already installed and it would appear that /opt/rocm/include/rocrand/rocrand_version.h does de…
-
We are using autorest.powershell to generate a module for a non-azure API.
when running a cmdlet with -debug the output seems to be missing:
```powershell
Get-ZNUser -Debug
DEBUG: CmdletBeginPro…
-
The build stops with the following error. I use the newest 5.0.2-1 rocm packages. Any ideas how to resolve this problem?
```
WARNING: Download from https://storage.googleapis.com/mirror.tensorflow…
-
## Proposed refactor
Deprecate the `Trainer(auto_select_gpus=True|False)` option and just enable it always when `devices=int` gets selected.
### Motivation
- The name is misleading, we don't…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar feature requirement.
### Description
The K…
-
## 🚀 Feature
### Motivation
Copy pasting context from here: https://github.com/PyTorchLightning/pytorch-lightning/pull/10408#pullrequestreview-810794634
A generic idea from @ningintheclou…