amzn / amzn-drivers

Official AWS drivers repository for Elastic Network Adapter (ENA) and Elastic Fabric Adapter (EFA)
457 stars 176 forks source link

[Bug]: coredump in ena_com.c #323

Open bashars-cp opened 1 month ago

bashars-cp commented 1 month ago

Preliminary Actions

Driver Type

Linux kernel driver for Elastic Network Adapter (ENA)

Driver Tag/Commit

ena - 2.5.0g

Custom Code

No

OS Platform and Distribution

GAIA R81.20 - Based on RHEL 7.9 (3.10.0-1160.15.2cpx86_64)

Bug description

crash -

BT: PID: 13075 TASK: ffff8801398f9540 CPU: 2 COMMAND: "ethtool" . . . [exception RIP: ena_com_indirect_table_get+125] RIP: ffffffff902e300d RSP: ffff880139417c58 RFLAGS: 00010202 RAX: 0000000000000001 RBX: ffffc90000ed5000 RCX: 0000000000000000 RDX: 0000000000000000 RSI: 0000000000000001 RDI: ffff88021a797020 RBP: ffff880139417cb0 R8: ffff880139417b48 R9: 0000000000000000 R10: 0000000000000001 R11: 0000000000000063 R12: ffff8801f61c2400 R13: ffff8801f61c2400 R14: 0000000000000000 R15: ffff880224380000 ORIG_RAX: ffffffffffffffff CS: 0010 SS: 0018

12 [ffff880139417cb8] ena_get_rxfh at ffffffff902dde5e [ena]

13 [ffff880139417cd0] ethtool_get_rxfh_indir at ffffffff816a8e74

14 [ffff880139417d08] dev_ethtool at ffffffff816aaa03

15 [ffff880139417e08] dev_ioctl at ffffffff816bcdbf

16 [ffff880139417e98] compat_sock_ioctl at ffffffff8167f351

17 [ffff880139417f30] compat_sys_ioctl at ffffffff8129944f

18 [ffff880139417f80] sysenter_dispatch at ffffffff817c8361

RIP: 00000000ffffe430 RSP: 00000000fffcd798 RFLAGS: 00000246 RAX: 0000000000000036 RBX: ffffffff817c8361 RCX: 0000000000008946 RDX: 00000000fffcd918 RSI: 0000000009b86008 RDI: 000000000000002d RBP: 0000000000000080 R8: 0000000000000000 R9: 0000000000000000 R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000 R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000 ORIG_RAX: 0000000000000036 CS: 0023 SS: 002b

Reproduction steps

Unexpected, there's no clear way to reproduce this, it happens often though.

Expected Behavior

No crash.

Actual Behavior

crash.

Additional Data

No response

Relevant log output

No response

Contact Details

bashars@checkpoint.com

davidarinzon commented 1 month ago

Hi @bashars-cp

2.5.0g driver is very old, it's from April 2021. There were many changes in the code since then. Please try using a more recent driver.