Closed fwmiller closed 6 years ago
I changed the order of my commands and I get different output. I'm getting this warning about not supporting the kerne ABI of 1:
root@arria10:~# rxe_cfg
rdma_rxe module not loaded
Name Link Driver Speed NMTU IPv4_addr RDEV RMTU
eth0 yes st_gmac 1500 10.0.1.26
root@arria10:~# rxe_cfg start
[ 125.217418] rdma_rxe: loaded
[ 125.247695] rdma_rxe: set rxe0 active
[ 125.251445] rdma_rxe: added rxe0 to eth0
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
Name Link Driver Speed NMTU IPv4_addr RDEV RMTU
eth0 yes st_gmac 1500 10.0.1.26 rxe0 (?)
root@arria10:~# rxe_cfg add eth0
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
root@arria10:~# rxe_cfg start
[ 141.360121] rdma_rxe: already configured on eth0
sh: line 0: echo: write error: Invalid argument
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
Name Link Driver Speed NMTU IPv4_addr RDEV RMTU
eth0 yes st_gmac 1500 10.0.1.26 rxe0 (?)
root@arria10:~# rxe_cfg
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
IB device 'rxe0' wasn't found
Name Link Driver Speed NMTU IPv4_addr RDEV RMTU
eth0 yes st_gmac 1500 10.0.1.26 rxe0 (?)
root@arria10:~# ibv_devices
libibverbs: Warning: Driver rxe does not support the kernel ABI of 1 (supports 2 to 2) for device /sys/class/infiniband/rxe0
device node GUID
------ ---------------
Moving to kernel 4.17 with the most recent rdma-core sucked into the yocto build fixed this problem
I ran into this issue with exactly the same observation as above. I am using 16. 04 ubuntu. All you need to do is to add location of libibverbs.so to LD_LIBRARY_PATHS
export LD_LIBRARY_PATH=$LD_LIBRARY_PATH:path_to_softroce/rdma-core/build/lib
I think this is similar to another issue #48 I believe.
I've got a good build now that comes up with SoftRoCE and seems to partially work. I'm trying to do an rping between the Arria10 and a PC VM. Here's some info. I've loaded all the Infiniband stuff as modules:
root@arria10:/# lsmod Module Size Used by rdma_rxe 102400 0 rdma_ucm 28672 0 rdma_cm 53248 1 rdma_ucm iw_cm 32768 1 rdma_cm ib_cm 40960 1 rdma_cm ib_uverbs 81920 1 rdma_ucm ib_core 192512 6 rdma_rxe,ib_cm,rdma_cm,ib_uverbs,iw_cm,rdma_ucm
I can see the rxe_cfg stuff:
root@arria10:/# rxe_cfg status Name Link Driver Speed NMTU IPv4_addr RDEV RMTU eth0 yes st_gmac 1500 10.0.1.25
However, the RMTU is missing here and when I execute ibv_devices, I don't get an rxe0 device with a GUID like I should.
root@arria10:/# ibv_devices device node GUID
Do you know about why this would be? Is something missing?
Thanks, FM