Closed ygao-armada closed 1 month ago
Hey @ygao-armada , i believe we resolved this via slack. If not please do reopen. The resolution was to enable NIC drivers in the kernel menuconfig. This one, https://www.kernelconfig.io/config_mlx5_core?arch=x86&kernelversion=6.6.54 and this one, https://www.kernelconfig.io/config_net_vendor_nvidia?arch=x86&kernelversion=6.6.54.
Thanks!
Hi,
I am facing same issues. Can you suggest, how this was done?
We have Intel NIC E810 on worker nodes and we are facing same issues?
Control plane node is deployed with Broadcom NIC, but worker node has intel NIC isn't working.
Hi,
I am facing same issues. Can you suggest, how this was done?
We have Intel NIC E810 on worker nodes and we are facing same issues?
Control plane node is deployed with Broadcom NIC, but worker node has intel NIC isn't working.
You need to build a custom HookOS with devices (mentioned by Jacob above) included, according to: https://anywhere.eks.amazonaws.com/docs/getting-started/baremetal/customize/bare-custom-hookos/
@ygao-armada Let me have a look at that.
Although, out of curiosity, I believe you got a different resolution. _i believe we resolved this via slack. If not please do reopen. The resolution was to enable NIC drivers in the kernel menuconfig. This one, https://www.kernelconfig.io/config_mlx5_core?arch=x86&kernelversion=6.6.54 and this one, https://www.kernelconfig.io/config_net_vendor_nvidia?arch=x86&kernelversion=6.6.54._
If our issues are same, would you let me know, why your solution is different then mine, just so i can learn more about potential issues, it might occur for our systems.
@akshay8043 oh, thanks for your help, especially from @jacobweinstock , in the custom HookOS, you may need to enable more, here is my list: Mellanox 5th generation network adapters (ConnectX series) core driver Mellanox Technologies Innova support Mellanox 5th generation network adapters (ConnectX series) Ethernet support Mellanox 5th generation network adapters (connectX series) IPoIB offloads support Mellanox Technologies subfunction device support using auxiliary device NVIDIA devices
It works for me already.
I try to create a EKSA bare metal cluster with Dell PowerEdge XE8640, and see the boots logs stuck with this:
And in the Virutal Console of iDRAC, the last messages are:
Expected Behaviour
Current Behaviour
Possible Solution
Steps to Reproduce (for bugs)
1. 2. 3. 4.
Context
Your Environment
Operating System and version (e.g. Linux, Windows, MacOS):
How are you running Tinkerbell? Using Vagrant & VirtualBox, Vagrant & Libvirt, on Packet using Terraform, or give details:
Link to your project or a code example to reproduce issue: