Closed micsuka closed 1 month ago
I recommend running a version released after 2021 and seeing if your problem is resolved.
(Specifically, https://github.com/openzfs/zfs/commit/4036b8d027fb7fe1a629b08a0d23cac975ab2eb9 might be useful, but there's a lot of bugs in native encryption, some of which have been fixed in the intervening 3.5 years since 2.0.3 was released. If you don't want to upgrade, you should probably file bugs against Debian, not upstream.)
Thank you, I've updated the zfs to 2.1.11-1~bpo11+1 on one server for now and I set the encryption back. It handles the same load now, let's see how it's behaving in the next few weeks.
so, I have zfs-2.1.11-1~bpo11+1 on all of our servers now... and it seems to be stable.
System information
Describe the problem you're observing
About the setup: We run several mariadb databases on zfs on Debian 11, the servers contain the same data through replication. All servers contain a zpool with mirrored SSDs and the dataset is compressed. I've attached the parameters of the datasets/pools.
I decided to move the database on an encrypted dataset, so I issued a
zfs send ... | zfs receive -o keyformat=raw -o keylocation=prompt...
a few days ago on 4 servers.So, after the datasets have been encrypted, this panic occured on 2 servers of 4, after around 3 days.
Here is the kernel message on server 1:
and here is the kernel log on server 2:
zfs1info.txt zfs2info.txt
I exclude the hardware problem, there was no trace of any error in the logs and like I said: these systems were rock solid for years. The servers contain ECC RAMs, the CPUs support aes.
Describe how to reproduce the problem
I'm confident that this problem is related to the zfs encryption.
Include any warning/errors/backtraces from the system logs