Azure VM with 4.15.0-1041-azure kernel cannot install Mellanox OFED driver (same issue for 4.3-*, 4.4-*, 4.5-*).
Here's part of the log after executing ./mlnxofedinstall --force --kernel-only --without-dkms --without-fw-update --with-infiniband-diags --package-install-options -D2 -vv (having run mlnx_add_kernel_support.sh before to add kernel support).
Below is the list of MLNX_OFED_LINUX packages that you have chosen
(some may have been added by the installer due to package dependencies):
libibumad
libopensm
libibmad
infiniband-diags
ofed-scripts
mlnx-ofed-kernel-utils
mlnx-ofed-kernel-modules
iser-modules
isert-modules
srp-modules
mlnx-nfsrdma-modules
mlnx-rdma-rxe-modules
kernel-mft-modules
knem-modules
This program will install the MLNX_OFED_LINUX package on your machine.
Note that all other Mellanox, OEM, OFED, RDMA or Distribution IB packages will be removed.
Those packages are removed due to conflicts with MLNX_OFED_LINUX, do not reinstall them.
Checking SW Requirements...
Running: dpkg --configure -a --force-all
Running: apt-get install -f
Removing old packages...
Installing new packages
Installing libibumad-43.1.1.MLNX20171122.0eb0969...
Running /usr/bin/dpkg -i --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/libibumad_43.1.1.MLNX20171122.0eb0969-0.1.43101_amd64.deb
Installing libopensm-5.0.0.MLNX20180219.c610c42...
Running /usr/bin/dpkg -i --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/libopensm_5.0.0.MLNX20180219.c610c42-0.1.43101_amd64.deb
Installing libibmad-1.3.13.MLNX20170511.267a441...
Running /usr/bin/dpkg -i --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/libibmad_1.3.13.MLNX20170511.267a441-0.1.43101_amd64.deb
Installing infiniband-diags-5.0.0.MLNX20180124.dfd2235...
Running /usr/bin/dpkg -i --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/infiniband-diags_5.0.0.MLNX20180124.dfd2235-0.1.43101_amd64.deb
Installing ofed-scripts-4.3...
Running /usr/bin/dpkg -i --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/ofed-scripts_4.3-OFED.4.3.1.0.1_amd64.deb
Installing mlnx-ofed-kernel-utils-4.3...
Running /usr/bin/dpkg -i --force-confnew --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/mlnx-ofed-kernel-utils_4.3-OFED.4.3.1.0.1.1.g8509e41.kver.4.15.0-1041-azure_amd64.deb
Installing mlnx-ofed-kernel-modules-4.3...
Running /usr/bin/dpkg -i --force-confnew --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/mlnx-ofed-kernel-modules_4.3-OFED.4.3.1.0.1.1.g8509e41.kver.4.15.0-1041-azure_all.deb
Error: mlnx-ofed-kernel-modules installation failed!
Collecting debug info...
See:
/tmp/MLNX_OFED_LINUX.31695.logs/mlnx-ofed-kernel-modules.debinstall.log
Removing newly installed packages...
Running: /usr/sbin/ofed_uninstall.sh --force --keep-mft
The command /usr/bin/dpkg -i --force-confnew --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/mlnx-ofed-kernel-modules_4.3-OFED.4.3.1.0.1.1.g8509e41.kver.4.15.0-1041-azure_all.deb was executed successfully, but mlnx-ofed-kernel-modules haven't been made after that. Following commands outputs empty.
$ depmod -a
$ lsmod | grep mlnx
The issue occurs after Azure VM upgrading to 4.15.0-1041-azure kernel automatically.
Azure VM with
4.15.0-1041-azure
kernel cannot install Mellanox OFED driver (same issue for4.3-*
,4.4-*
,4.5-*
).Here's part of the log after executing
./mlnxofedinstall --force --kernel-only --without-dkms --without-fw-update --with-infiniband-diags --package-install-options -D2 -vv
(having runmlnx_add_kernel_support.sh
before to add kernel support).Here's part of the log file:
The command
/usr/bin/dpkg -i --force-confnew --force-confmiss -D2 /var/drivers/mellanox/MLNX_OFED_LINUX-4.3-1.0.1.0-ubuntu16.04-x86_64/DEBS/mlnx-ofed-kernel-modules_4.3-OFED.4.3.1.0.1.1.g8509e41.kver.4.15.0-1041-azure_all.deb
was executed successfully, but mlnx-ofed-kernel-modules haven't been made after that. Following commands outputs empty.The issue occurs after Azure VM upgrading to 4.15.0-1041-azure kernel automatically.