HaveAGitGat / Tdarr

Tdarr - Distributed transcode automation using FFmpeg/HandBrake + Audio/Video library analytics + video health checking (Windows, macOS, Linux & Docker)
Other
3.06k stars 96 forks source link

Tdarr ffmpeg stack trace #1115

Open marshalleq opened 2 weeks ago

marshalleq commented 2 weeks ago

Please put plugin requests/bugs at: https://github.com/HaveAGitGat/Tdarr_Plugins

I have been trying to pinpoint some system issues lately (running on TrueNAS electric eel) and have just seen tdarr ffmpeg core dumps in dmesg which obviously got me to here. System has 128GB RAM, swappiness is set to 1.

`v2024 Nov 6 01:23:23 Skywalker Process 591446 (tdarr-ffmpeg) of user 568 dumped core.

Stack trace of thread 779:

0 0x00007f8df1891014 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x93014)

1 0x00007f8df1893663 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x95663)

2 0x00007f8df187dc0d n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7fc0d)

3 0x00007f8df187077a n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7277a)

ELF object binary architecture: AMD x86-64`

The system is a threadripper 2950x which has for a long time server me well with Tdarr. I saw this happening a while back and assumed it was because I'd switched from unraid to opensuse for my encoding node, however I don't think that's why, I think this is just something about later versions of tdarr OR it always happened and I never noticed. Point being this isn't exactly new, though it may be sort of recent. If there are any other kind of logs you wish to see please let me know.

I also received this overnight, which seems to be the folder for core dumps, haven't figured out how to access it yet though.

Quota exceeded on dataset ssd1pool/.system/cores. Used 79.78% (816.96 MiB of 1 GiB)..

To Reproduce Queue up some files and encode.

Please provide the following information: Sorry I'm running out to work in a moment and wanted to just get this started, can add all of this later if it's needed.

-Worker error [can be found on the 'Tdarr' tab by pressing the 'i' button on a failed item in the staged file section or in the transcode error section at the bottom] There are a lot of these

1
2024-11-06T07:59:07.343Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:[Step W07] [C1] Worker [-error-]
2
2024-11-06T07:59:07.344Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:Subworker killed
3
2024-11-06T07:59:07.344Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:[-error-]
4
2024-11-06T07:59:07.344Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:Subworker exited null
5
2024-11-06T07:59:09.386Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:[2/2] Delete success
6
2024-11-06T07:59:09.386Z zfMnvs3mEfo:Node[Obiwan Node 1]:Worker[false-frog]:Updating transcode stats

TrueNAS Electric Eel Safari latest

marshalleq commented 2 weeks ago

Just adding, that in the host dmesg, I get the below also:

`2024 Nov 10 12:08:58 Skywalker Process 1644959 (tdarr-ffmpeg) of user 568 dumped core.

Stack trace of thread 1789:

0 0x00007fcfebc62014 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x93014)

1 0x00007fcfebc64663 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x95663)

2 0x00007fcfebc4ec0d n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7fc0d)

3 0x00007fcfebc4177a n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7277a)

ELF object binary architecture: AMD x86-64

2024 Nov 10 12:41:54 Skywalker Process 1671381 (tdarr-ffmpeg) of user 568 dumped core.

Stack trace of thread 2169:

0 0x00007fa0734d1014 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x93014)

1 0x00007fa0734d3663 n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x95663)

2 0x00007fa0734bdc0d n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7fc0d)

3 0x00007fa0734b077a n/a (/usr/lib/x86_64-linux-gnu/libx265.so.199 + 0x7277a)

ELF object binary architecture: AMD x86-64`