Xilinx / open-nic-driver

AMD OpenNIC driver includes the Linux kernel driver
GNU General Public License v2.0
58 stars 40 forks source link

Driver getting stuck on load against u50 #25

Open vaniaprkl opened 2 years ago

vaniaprkl commented 2 years ago

Hi, I am trying to get the driver up to do a loopback test on the U50. I followed the steps and with the firmware loaded tried to run insmod onic.ko The command does not return and I see the following in dmesg:

[ 1450.565564] onic 0000:82:00.0: device is a master PF
[ 1450.565839] onic 0000:82:00.0: Allocated 8 queue vectors
[ 1451.063047] BUG: kernel NULL pointer dereference, address: 0000000000000008
[ 1451.063050] #PF: supervisor read access in kernel mode
[ 1451.063051] #PF: error_code(0x0000) - not-present page
[ 1451.063051] PGD 0 P4D 0
[ 1451.063053] Oops: 0000 [#1] SMP PTI
[ 1451.063056] CPU: 14 PID: 261 Comm: kworker/14:1 Tainted: G          OE    5.4.0-109-generic #123-Ubuntu
[ 1451.063057] Hardware name: Supermicro X10DRH/X10DRH-iT, BIOS 2.0 12/17/2015
[ 1451.063061] Workqueue: events work_for_cpu_fn
[ 1451.063067] RIP: 0010:qdma_invalidate_fmap_ctxt+0x20/0x60 [onic]
[ 1451.063069] Code: 66 d0 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 55 31 c9 31 d2 48 89 e5 48 83 ec 10 65 48 8b 04 25 28 00 00 00 48 89 45 f8 31 c0 <0f> b7 47 08 c7 47 0a 00 00 00 00 48 8d 75 f4 25 ff 07 00 00 c1 e0
[ 1451.063070] RSP: 0018:ffffa43e8e2dfd38 EFLAGS: 00010246
[ 1451.063071] RAX: 0000000000000000 RBX: ffff8e1e244fc8c0 RCX: 0000000000000000
[ 1451.063072] RDX: 0000000000000000 RSI: ffffa43e8e901000 RDI: 0000000000000000
[ 1451.063072] RBP: ffffa43e8e2dfd48 R08: 0000034bebdb23f0 R09: 000000000000000e
[ 1451.063073] R10: ffffa43ea0c40000 R11: ffff8e1e245f68c0 R12: ffff8e0db3fcb000
[ 1451.063074] R13: 0000000000000000 R14: 00000000fffffff0 R15: ffff8e0db3fcb000
[ 1451.063075] FS: 0000000000000000(0000) GS:ffff8e1e3f800000(0000) knlGS:0000000000000000
[ 1451.063076] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 1451.063077] CR2: 0000000000000008 CR3: 0000002ffc60a001 CR4: 00000000003606e0
[ 1451.063078] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[ 1451.063078] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[ 1451.063079] Call Trace:
[ 1451.063083] onic_init_hardware+0x10d/0x820 [onic]
[ 1451.063086] onic_probe+0x22b/0x2c0 [onic]
[ 1451.063089] local_pci_probe+0x48/0x80
[ 1451.063093] ? __schedule+0x2eb/0x740
[ 1451.063095] work_for_cpu_fn+0x1a/0x30
[ 1451.063097] process_one_work+0x1eb/0x3b0
[ 1451.063099] worker_thread+0x21e/0x400
[ 1451.063100] kthread+0x104/0x140
[ 1451.063102] ? process_one_work+0x3b0/0x3b0
[ 1451.063103] ? kthread_park+0x90/0x90
[ 1451.063105] `ret_from_fork+0x35/0x40
108anup commented 2 years ago

Potentially: https://github.com/Xilinx/open-nic-driver/issues/28 ?