Open Keno opened 2 years ago
AMD Custom APU 0405
Is this a special/experimental APU? In the past, we've had bugs and segfaults with APUs (including on my own).
Are you using AMDGPU-provided ROCm artifacts, or system libraries?
Is this a special/experimental APU?
No, this is an AMD Van Gogh APU
Are you using AMDGPU-provided ROCm artifacts, or system libraries?
AMDGPU-provided
Can you try disabling artifacts with JULIA_AMDGPU_DISABLE_ARTIFACTS=1
and re-building AMDGPU? Assuming you have a system-provided ROCm available.
Can you try disabling artifacts with
JULIA_AMDGPU_DISABLE_ARTIFACTS=1
and re-building AMDGPU? Assuming you have a system-provided ROCm available.
Segfaults also, similar backtrace:
signal (11): Segmentation fault
in expression starting at none:0
unknown function (ip: 0x7f09e0f1e0fd)
unknown function (ip: 0x7f09e0f1e3b7)
hipStreamSynchronize at /opt/rocm/lib/libamdhip64.so (unknown line)
macro expansion at /home/deck/.julia/packages/AMDGPU/PtNLZ/src/hip/error.jl:149 [inlined]
hipStreamSynchronize at /home/deck/.julia/packages/AMDGPU/PtNLZ/src/hip/libhip.jl:2
So, if you want to just hide libamdhip64.so
from AMDGPU (just make it .bak
or similar), we can load without it. You may also need to do the same for rocBLAS, rocFFT, et. al.
If you actually want full functionality, then building glibc with debug symbols would be very helpful.
Is it actually in glibc though? Presumably __pthread_once_slow
calls back into whatever callback HIP passes it. I tried building HIP with debug symbols, but ran into https://github.com/JuliaPackaging/Yggdrasil/pull/4689#issuecomment-1081262980