NetSys / bess

BESS: Berkeley Extensible Software Switch
Other
311 stars 174 forks source link

Page fault on vport delete #845

Open muraliran opened 6 years ago

muraliran commented 6 years ago

This is pretty consistent. If I recall well, this wasn't happening before vport refactor. This is ubuntu 18.04 vbox vm using generic pio.

Cleaned and recompiled all (including kmod) but problem remains.

[ 899.289069] BUG: unable to handle kernel paging request at ffffb185057118c0 [ 899.289095] IP: sn_release_netdev+0x17/0x40 [bess]

Steps to repro

  1. Create docker
  2. Attach vport (stadard bess procedure)
  3. Delete docker instance (stop & rm)
  4. bess hangs

Full trace:


[  829.645939] bess - sn_register_netdev():895 v_mlocal_1: registered - e6:8d:e7:03:44:92 txq 1 rxq 1
[  831.100724] bess - sn_register_netdev():895 v_mlocal_2: registered - 96:f7:38:c9:e2:bf txq 1 rxq 1
[  878.074034] docker0: port 1(veth4f435fe) entered disabled state
[  878.075258] veth3c9ec4b: renamed from eth0
[  878.108181] docker0: port 1(veth4f435fe) entered disabled state
[  878.111852] device veth4f435fe left promiscuous mode
[  878.111855] docker0: port 1(veth4f435fe) entered disabled state
[  878.216952] bess - sn_netdev_destructor():750 v_mlocal_1: releasing netdev...
[  899.289069] BUG: unable to handle kernel paging request at ffffb185057118c0
[  899.289095] IP: sn_release_netdev+0x17/0x40 [bess]
[  899.289106] PGD 217141067 P4D 217141067 PUD 217142067 PMD 20c720067 PTE 0
[  899.289123] Oops: 0000 [#1] SMP PTI
[  899.289132] Modules linked in: veth bess(OE) uio_pci_generic uio ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay aufs joydev intel_rapl_perf snd_intel8x0 snd_ac97_codec ac97_bus snd_pcm snd_timer binfmt_misc input_leds serio_raw snd soundcore vboxvideo(CE) ttm mac_hid vboxguest drm_kms_helper drm fb_sys_fops syscopyarea sysfillrect sysimgblt sch_fq_codel ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi ip_tables x_tables autofs4 btrfs zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear
[  899.289297]  hid_generic usbhid hid crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc aesni_intel aes_x86_64 crypto_simd glue_helper cryptd psmouse ahci libahci pata_acpi i2c_piix4 e1000 video
[  899.289341] CPU: 1 PID: 24948 Comm: bessd Tainted: G         C OE    4.15.0-10-generic #11-Ubuntu
[  899.289359] Hardware name: innotek GmbH VirtualBox/VirtualBox, BIOS VirtualBox 12/01/2006
[  899.289377] RIP: 0010:sn_release_netdev+0x17/0x40 [bess]
[  899.289389] RSP: 0018:ffffb18500d17e10 EFLAGS: 00010286
[  899.289401] RAX: 0000000000000000 RBX: ffffb185057118c0 RCX: 00007f8fc0018650
[  899.289416] RDX: ffff8a0411071700 RSI: 0000000000008502 RDI: ffffffffb9fe64e0
[  899.289431] RBP: ffffb18500d17e18 R08: ffff8a040cb63440 R09: 0000000000000013
[  899.289445] R10: ffff8a040cda9f38 R11: 0000000000003246 R12: ffffb185057118c0
[  899.289492] R13: ffff8a040cda9f00 R14: 0000000000008502 R15: 00007f8fc0018650
[  899.289554] FS:  00007f8fce3ff700(0000) GS:ffff8a041fd00000(0000) knlGS:0000000000000000
[  899.289585] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  899.289598] CR2: ffffb185057118c0 CR3: 000000020c7e2001 CR4: 00000000000606e0
[  899.289628] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  899.289643] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  899.289658] Call Trace:
[  899.289668]  sn_host_ioctl+0x19c/0x280 [bess]
[  899.289680]  do_vfs_ioctl+0xa8/0x630
[  899.289690]  ? handle_mm_fault+0xb1/0x200
[  899.289701]  ? __do_page_fault+0x277/0x4d0
[  899.290114]  SyS_ioctl+0x79/0x90
[  899.290587]  do_syscall_64+0x76/0x130
[  899.290995]  entry_SYSCALL_64_after_hwframe+0x21/0x86
[  899.291497] RIP: 0033:0x7f8ff384b5d7
[  899.291888] RSP: 002b:00007f8fce3fe118 EFLAGS: 00003246 ORIG_RAX: 000000000000001
sangjinhan commented 6 years ago

I was able to reproduce the issue. I will take a look.