Open FluxusMagna opened 2 years ago
Make sure you have the latest ROCm installation. gfx1030 support is enabled in the 5.1 branch (see https://github.com/ROCm-Developer-Tools/ROCclr/blob/rocm-5.1.x/device/device.cpp#L183).
I am using 5.1.1 so that should not be the issue.
@vsytch it would be helpful if you could point us to the logic of platform.getDevices
to help us trace this down. As in how (and where in the code, specifically) are the devices queried from the hardware?
Yeah I noticed this too. My Raven APU shows up, but I don't see the gfx1030.
Can you run:
AMD_LOG_LEVEL=4 clinfo
I just realised that my install of compiler (clang/comgr/llvm) was messed up. E.g. it was trying to use comgr with an older clang/llvm somehow, so it obviously failed. There might be an issue with the rocm-arch packages. By default comgr should statically link against clang, but it is possible to dynamically link it, which is what I did.
That sounds very plausible. I got it to work with repackaged Ubuntu packages, so it is likely something related to the arch packages. I forgot to post that here though. At the moment I have no quick way to test the hypothesis as the machine is currently in use for relatively urgent work, but you can close the issue if you see fit.
The card shows up in
lshw
androcminfo
, butclinfo
shows0
devices for the AMD platform. I previously had a discussion(https://github.com/rocm-arch/rocm-arch/discussions/768) with the arch-linux package maintainers(@acxz) and concluded that it seems to involve upstream code.clinfo
output:rocminfo
output:The likely problem was traced back to
https://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/blob/bbdc87e08b322d349f82bdd7575c8ce94d31d276/tools/clinfo/clinfo.cpp#L124
and then
https://github.com/RadeonOpenCompute/ROCm-OpenCL-Runtime/blob/bbdc87e08b322d349f82bdd7575c8ce94d31d276/tools/clinfo/clinfo.cpp#L115