centerforaisafety / cerberus-cluster

HPC cluster code and configurations for running on OCI
Universal Permissive License v1.0
4 stars 0 forks source link

Have the slurm logs save to FSS #87

Open steven-safeai opened 1 year ago

steven-safeai commented 1 year ago

Move the default SlurmctldLogFile=/var/log/slurm/slurmctld.log SlurmdLogFile=/var/log/slurm/slurmd.log

to FSS

andriy-safe-ai commented 1 year ago

@steven-basart Has this been done?

steven-safeai commented 1 year ago

We might need to move the control demon back if latency gets too high.

andriy-safe-ai commented 1 year ago

@steven-basart What latency are you referring to?