Closed hshh closed 10 months ago
2 unbound.conf attached. Multiple unbound instance with unbound.conf.1 is running with setfib. As I said in issue #957.
unbound.conf.1
server:
auto-trust-anchor-file: "root.key"
root-hints: "root.hints"
num-threads: 16
interface: 127.0.0.1
interface: ::1
outgoing-range: 32768
num-queries-per-thread: 4096
so-reuseport: yes
msg-cache-slabs: 16
rrset-cache-slabs: 16
infra-cache-slabs: 16
key-cache-slabs: 16
ratelimit-slabs: 16
cache-max-negative-ttl: 0
infra-host-ttl: 300
module-config: "iterator"
hide-identity: yes
hide-version: yes
rrset-roundrobin: yes
do-not-query-localhost: no
log-servfail: yes
port: 53000
pidfile: "unbound.pid"
remote-control:
control-port: 53100
control-enable: yes
control-use-cert: no
control-interface: 127.0.0.1
control-interface: ::1
# root
auth-zone:
name: "."
primary: 199.9.14.201 # b.root-servers.net
primary: 192.33.4.12 # c.root-servers.net
primary: 199.7.91.13 # d.root-servers.net
primary: 192.5.5.241 # f.root-servers.net
primary: 192.112.36.4 # g.root-servers.net
primary: 193.0.14.129 # k.root-servers.net
primary: 192.0.47.132 # xfr.cjr.dns.icann.org
primary: 192.0.32.132 # xfr.lax.dns.icann.org
primary: 2001:500:200::b # b.root-servers.net
primary: 2001:500:2::c # c.root-servers.net
primary: 2001:500:2d::d # d.root-servers.net
primary: 2001:500:2f::f # f.root-servers.net
primary: 2001:500:12::d0d # g.root-servers.net
primary: 2001:7fd::1 # k.root-servers.net
primary: 2620:0:2830:202::132 # xfr.cjr.dns.icann.org
primary: 2620:0:2d0:202::132 # xfr.lax.dns.icann.org
fallback-enabled: yes
for-downstream: no
for-upstream: yes
unbound.conf.2
server:
port: 5300
chroot: "/usr/local/etc/unbound"
directory: "/usr/local/etc/unbound"
#verbosity: 1
num-threads: 16
interface: 0.0.0.0
interface: ::0
outgoing-range: 32768
num-queries-per-thread: 4096
so-reuseport: yes
msg-cache-slabs: 16
rrset-cache-slabs: 16
infra-cache-slabs: 16
key-cache-slabs: 16
ratelimit-slabs: 16
rrset-cache-size: 100k
msg-cache-size: 100k
key-cache-size: 100k
cache-max-negative-ttl: 0
infra-keep-probing: yes
access-control: 0.0.0.0/0 allow
auto-trust-anchor-file: "root.key"
pidfile: "unbound.pid"
module-config: "iterator"
hide-identity: yes
hide-version: yes
rrset-roundrobin: yes
do-not-query-localhost: no
log-servfail: yes
private-address: ::/0
python:
remote-control:
control-enable: yes
control-use-cert: no
control-interface: 127.0.0.1
control-interface: ::1
control-port: 5303
forward-zone:
name: "."
forward-addr: 127.0.0.1
forward-no-cache: yes
This seems like a FreeBSD kernel error. From Unbound's side I see that this may happen during a UDP bind() call. Do I understand correctly that unbound.conf.1 has multiple instances with setfib, while unbound.conf.2 has only one? Do you see these often? And am I correct to assume that this happens during startup/reload? Does the same setup work reliably with older versions?
Things you could try:
Hmm, I tested unbound-1.17.1, and it caused kernel panic too. I think it is an OS bug. The system does not crash during startup/reload of unbound. It happens irregularly. Now I am running unbound 1.19 with "so-reuseport: no" for testing. And I saved the PID information of all unbound processes.
This is not unbound bug. It is FreeBSD kernel bug. And it looks very similar to https://bugs.freebsd.org/bugzilla/show_bug.cgi?id=273890 Please see details and try the patch.
"so-reuseport: no" only reduces the occurrence of the problem, it can still lead to kernel panic. I am testing the kernel patch now.
Interested if this was the fix for you @hshh? Did you get it done?
Interested if this was the fix for you @hshh? Did you get it done?
Yes, the system has become stable after applying the kernel patch. The uptime is 16 days now.
I hope it can be merged into 14.0-RELEASE, @glebius .
It can't be merged into 14.0-RELEASE, since the release happened in the past. We might make a Errata Notice and a patch level for the releng branch.
Closing this as a non-issue for Unbound then :)
unbound 1.19, FreeBSD 14.0-RELEASE
Because I am running multiple unbound instance in same host with different config file, I am not sure which one cause panic. I will keep updating this thread.
Here is panic info,