Closed SebastianAchilles closed 2 years ago
Magic Castle Slurm RPMs are built with COPR: https://copr.fedorainfracloud.org/coprs/cmdntrf/ slurm-slurmd-20.11.7-1.el8.x86_64 was built 4 months ago with hwloc-devel 1.11.9-3.el8. Unfortunately, CentOS 8 has since replaced hwloc and hwloc-devel with version 2.2.0-1.el8, hence the error you obtained.
I have trigger a rebuilt of slurm-slurmd RPM for CentOS 8 to build it against hwloc 2.2.0 instead. This should fix the issue.
When I build a cluster with this
main.tf
on JUSUFbuilding
slurm-slurmd
fails on the GPU node with:To build
libhwloc.so.5
I had to usedbecause the
hwloc
for the OS repo was too new. On the CPU node buildingslurm-slurmd
worked directly.I also tested v11.4 and v11.5, but I got a different error. That is why I am using v11.2 at the moment.