ValveSoftware / steam-for-linux

Issue tracking for the Steam for Linux beta client
4.26k stars 174 forks source link

game installs are corrupting BTRFS on Fedora 40 #10903

Open jorp opened 6 months ago

jorp commented 6 months ago

Your system information

Please describe your issue in as much detail as possible:

Whenever installing a new game, BTRFS becomes corrupted and is not correctable. The game will start installing as normal, and sometime into downloading, it will stop and state it is corrupted and move on to the next game.

I have even done a fresh install of Fedora 40 on a brand new SSD and received the same result.

Steps for reproducing this issue:

  1. Install Fedora 40
  2. Make no changes to default filesystems or parititoning, choose to encrypt installation with LUKS
  3. Install steam from rpmfusion repository
  4. Begin installing a game
  5. Watch for errors in the steam client or in dmesg (below)
[ 2479.397510] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0x355bd35c expected csum 0x36d69a3a mirror 1
[ 2479.397537] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 1, gen 0
[ 2479.438946] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0xb9577995 expected csum 0x36d69a3a mirror 1
[ 2479.438968] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 2, gen 0
[ 2479.453308] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0xb9577995 expected csum 0x36d69a3a mirror 1
[ 2479.453333] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 3, gen 0
[ 2488.515036] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0xb9577995 expected csum 0x36d69a3a mirror 1
[ 2488.515052] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 4, gen 0
[ 2488.515308] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0xb9577995 expected csum 0x36d69a3a mirror 1
[ 2488.515323] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 5, gen 0
[ 2488.515436] BTRFS warning (device dm-0): csum failed root 256 ino 188739 off 147456 csum 0xb9577995 expected csum 0x36d69a3a mirror 1
[ 2488.515446] BTRFS error (device dm-0): bdev /dev/mapper/luks-ffffffffffffff errs: wr 0, rd 0, flush 0, corrupt 6, gen 0
jorp commented 6 months ago

So I switched to beta, and the problem held off for a lot longer than it did last time. Now, where the games were nearly done, I received 'corrupt update files' for one game, and 'disk read error' for the other. dmesg is full of BTRFS errors again.

Here is a particular path included this time:

[ 5224.858741] BTRFS warning (device dm-0): checksum error at logical 152508235776 on dev /dev/mapper/luks-87b10272-c9ac-4365-8e90-a82ab75f77f3, physical 153623920640, root 256, inode 214119, offset 150450176, l
ength 4096, links 1 (path: jorp/.local/share/Steam/steamapps/downloading/2328760/PinballFX/Content/Paks/pakchunk2166-WindowsNoEditor.pak)

Here are the games in question: Screenshot from 2024-05-17 22-05-54

jorp commented 6 months ago

Also made a post on the Fedora forums here

g572staem commented 6 months ago

That's a filesystem problem, it just so happens that steam is provoking it. I would recommend you use a different file system, ext4 is old as shit and pretty stable.

hifron commented 6 months ago

check your ssd health and lifespan.

jorp commented 6 months ago

check your ssd health and lifespan.

Thanks, I've already done this and haven't seen issues. I mentioned in the OP that I've also been able to reproduce this with a brand new drive.

As an update, I'm looking into RAM issues and it's possible that may be the culprit. Still researching and troubleshooting at the moment though.

jorp commented 6 months ago

Hi all, I ram memtest86 and got 600+ errors.. I've since RMA'd my RAM and am using a fresh set while I await its return. I haven't run into any issues since. I am going to keep this open for another week or so to see if this comes back.

kisak-valve commented 5 months ago

Closing as a hardware issue.

trgeiger commented 5 months ago

I'm also having this issue in the last couple days on a brand new machine. Seems odd for this to be popping up for multiple people. Also on Fedora using the rpmfusion packaged steam on BTRFS+LUKS.

The files at the inode from the error are: .local/share/Steam/steamapps/common/SteamLinuxRuntime_sniper/sniper_platform_0.20240618.92328/files/lib/i386-linux-gnu/libicudata.so.67.1 .local/share/Steam/steamapps/common/SteamLinuxRuntime_sniper/var/tmp-XG8DQ2/usr/lib/i386-linux-gnu/libicudata.so.67.1

I'll run memtest soon just to make sure.

trgeiger commented 5 months ago

Update: memtest86+ passed. So no memory or disk hardware issues on my end. I deleted those sniper runtime files and had steam redownload them and I haven't had the issue pop up again, yet.

Destinyg133 commented 2 months ago

I have problem exactly like this Using flatpak everything is fine Or if I move game from windows to linux and redownloading last parts

jorp commented 1 month ago

Sorry to bump an old issue, but looks like this problem has returned. I tested my RAM and found a bunch of errors again.. I was surprised because I just replaced it a few months ago.

I did notice that disabling XMP in my BIOS resulted in no more memtest86 errors. However, it looks like this issue with BTRFS is still persisting.

It did go away for some time after RMA-ing my RAM, and I guess it is possible that this pair somehow became faulty too.. not really sure were to direct tickets and information. My issue here was (understandably) closed, and I am still seeing others that are loosely related here.

jorp commented 1 month ago

@kisak-valve could you reopen this?