-
I am looking for some reporting command similar to Slurm's sreport. From the man page, "sreport is used to generate reports of job usage and cluster utilization for Slurm jobs saved to the Slurm Datab…
-
I recently had some issues running on a slurm-based IB system. The problem was I was using the default launcher for IB when COMM=gasnet, which is `gasnetrun_ibv`. However, that launcher requires you t…
-
make sure all the scheduled slurm jobs in scrontab have an error and output dir instead of using sportbot home dir by default.
among those jobs:
- [x] FTP SYNC (prod, dev)
/hps/software/users/parki…
-
This is very rough draft prototype examples.
Todo:
- [ ] Find out how to specify the slurm output directory
- [x] Figure out how to send individual jobs to maximize thread/core use
- [x] Explore P…
-
I am trying to run a simple pretraining job with nemorun: `nemorun llm pretrain --factory llama3_8b`
However, I see the following error before the training starts:
```
The application appears to have…
-
#### Details
* Slurm Version: 21.08.5
* Python Version: 3.9.20
* Cython Version: 3.0.11
* PySlurm Branch: v21.08.4
* Linux Distribution:Ubuntu 22.04.3 LTS
#### Issue
I am using pyslurm bu…
-
Topic: Integrate Prometheus data into Grafana and create dashboards for Slurm monitoring.
Tasks:
- Add Prometheus as a Data Source: Configure Grafana to use Prometheus.
- Import Dashboards…
-
### What happened + What you expected to happen
Related issue: https://github.com/ray-project/ray/issues/13607
Ray will bypass CPU limits set by SLURM and access all available CPUs. This is a sign…
-
Hello,
{future.mirai} is a dream :) Any chance we could get a minimal working example for getting this to work on a slurm cluster? I am struggling to connect the dots.
Do I need to set up the d…
-
* Test qcloud2-pipeline at the new VM proteomics@proteomics-qc.hpc.crg.es
* https://dokuwiki.linux.crg.es/doku.php?id=sit:nextflow_on_new_cluster