-
Not sure if this can be resolved, but I wonder if it would be possible to check if the nodes are available before provisioning the cluster network rather than provisioning and then waiting for an erro…
-
### First check
- [X] I added a descriptive title to this issue.
- [X] I used GitHub search to find a similar request and didn't find it 😇
### Describe the issue
There are currently no examp…
-
Suggestions from the meeting today:
- [x] Single central repo, on GitHub, with software versions
- [x] Conda env resolution: Update dependencies, make pinning more flexible if possible
- [x] Add …
-
I tried to **use mim to submit training tasks asynchronously on Slurm**, using the following command:
`mim train mmcls resnet101_b16x8_cifar10.py --launcher slurm --gpus 1 --gpus-per-node 1 --partiti…
-
**Describe the bug**
When the necessary executor isn't installed now, snakemake will fail with an invalid choice message. Sunbeam should provide the proper remediation (go install the proper executor…
-
TurboVNC.repo can no longer be found at https://turbovnc.org/pmwiki/uploads/Downloads/TurboVNC.repo
New link points to https://raw.githubusercontent.com/TurboVNC/repo/main/TurboVNC.repo
This aff…
-
My workflow utilizes instance-level NVME-SSDs as a local scratch disk and therefore I utilize `EphemeralVolume` for `SlurmQueues` . It would be nice to have the total size of the `EphemeralVolume` be…
-
We're getting
eager_rcv.c:83 UCX Assertion `length >= hdr_len' failed
from time to time, consistently for a specific test case, and at least one more.
Sometime prior to that we also see repeate…
-
any example combining nextflow with slurm system?
-
Hello,
I already tried to run several jobs on a cluster. The jobs are running on the server but the output files are always empty. I would be grateful if you could help me,
Thank you in advance.…