openhpc / ohpc

OpenHPC Integration, Packaging, and Test Repo
http://openhpc.community
Apache License 2.0
840 stars 185 forks source link

rocky 8.9 + warewulf+slurm error detecting hardware #1947

Open luciandf opened 4 months ago

luciandf commented 4 months ago

trying to provision some test nodes with Rocky 8.9 warewulf and slurm. I am following the recipe but when I try to boot the clients, i get this error at the beginning:

Now Booting Warewulf...

Setting the hostname (name):

Detecting hardware: PCI aliases for this kernel is not available!

Bringing up local loopback network:
Checking for network device: enp6s18 (enp6s18)

ERROR: Network hardware was not recognized!

then the client reboots.

My setup is as follows: the master node is a VM in a machine running proxmox. The client is also a VM in a different machine running proxmox. This is exactly our production system which works fine and it was installed with OpenHPC 2 in 2022. We use rocky 8.5 and now I am trying to test a newer kernel and rocky 8.9.

Anyone knows what causes this error?