charmed-hpc / slurm-charms

Juju charms for automating the Day 0 to Day 2 operations of the Slurm workload manager ⚖️🐧
Apache License 2.0
1 stars 5 forks source link

enhancement(slurmctld): Signal children processes #45

Closed jamesbeedy closed 5 days ago

jamesbeedy commented 5 days ago

Slurm process tracking is currently not configured to kill child processes of a job. These changes set SignalChildrenProcesses=yes in cgroup.conf to enable this functionality.

Fixes #37

jamesbeedy commented 5 days ago

Closing this until slurm charms support the configuration.

NucciTheBoss commented 4 days ago

@jamesbeedy what specifically do you need for the charms to support this configuration in cgroup.conf? Do we need to modify which cgroup version is running on the machine?

jamesbeedy commented 4 days ago

@jamesbeedy what specifically do you need for the charms to support this configuration in cgroup.conf? Do we need to modify which cgroup version is running on the machine?

Slurm > 23.02

NucciTheBoss commented 4 days ago

Ah, that'll do it. We'll have at least Slurm 23.11 once the fixes for the apt charm library are landed for Noble. Should be some time this week.