-
Hello, I'm getting the same error as issue #56:
panic: runtime error: index out of range [4] with length 4
goroutine 26 [running]:
main.ParseNodeMetrics(0xc000140000, 0x25e, 0x600, 0x87080d)
…
-
Hi - I wonder if someone already tried to used LINDA on singularity and use the BATCH processsing with slurm?
I tried the slurm code below with a R script encompassing the required lines of command…
-
Hello Xiyu,
When I was running AmpliCI and pool all samples, It worked for a small dataset close to 9,000,000 but this error for 60,000,000:
/var/slurm/spool/slurmd/job40856275/slurm_script: li…
-
### Your current environment
```text
The output of `python collect_env.py`
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12…
-
## nf-core
nf-core produces pipelines with gold standard tools for various analyses.
- Workflow manager using `nextflow` language.
- Compatible with SLURM
- Uses Docker, Singularity and conda t…
-
Similar to the current support for slurm, please extend to LSF as well.
-
### Bug description
I'm training LLMs across multiple GPUs on a single node using `Nvidia/NeMo`.
When launching via `python train.py` inside of an allocation I get much worse performance than when l…
-
The workers OOMs a few times during the tokenization of a dataset with very long documents (over 1M chars), but succeed in the end by adjusting batch size of `BatchTokenizer` and just retrying.
@dl…
-
In slurm many scripts use signals to get a notification before the time limit is reached. They use them to create a checkpoint and force a requeue of the job in question. One such example is Lightning…
-
## Description
When running `cg decompress` the resulting fastq files are not automatically added to housekeeper after the decompression slurm job is complete. This is however the original intentio…