Closed supermar1010 closed 1 year ago
Hi. Did you manage to find a workaround for this?
I wouldn't say I found a workaround. I found out there are prevuolt versions in the arch4edu repo, it's linked in the readme :) Didn't work for me though because I have a rx570 which is not supported.
ROCm doesn't support mlir-generated gpu kernels yet so we need to include a further build argument --define=tensorflow_enable_mlir_generated_gpu_kernels=0
EDIT:
The above is not exactly correct. Whilst it gets the build further along, AMD have dropped support for gfx803 (RX470/570) and previous. It is possible that it could still build against that target but it requires at least a rocBLAS workaround that isn't implemented so it seems any attempt to build this repo as it stands will fail whether the user has a gfx803 card or not.
Also, If bazel is being run using java16 we also need to set a vm flag as default behavior changed in this version. bazel --host_jvm_args=--illegal-access=permit
There are still other issues I'm facing but I will pull request all of this when (hopefully) I've managed to build tensorflow-rocm
@astrowave I believe gfx803 support is reintroduced in rocm 4.3.0. Search 'gfx803' here http://radeonopencompute.github.io/ROCm/
Could you be so nice as to look into this?
I'm building 2.6.0 with rocm 4.3 and stuck at the issue in the OP.
EDIT: after editing the PKGBUILD and restricting target to only gfx803
, the compilation finishes! It might be because my rocblas
is targeted for gfx803
only.
@supermar1010 You might need to rebuild rocblas. Try to Delete library/src/blas3/Tensile/Logic/asmfull/r9nano*.yaml from rocBLAS. According to this: https://github.com/xuhuisheng/rocm-build/tree/master/gfx803#rocm-41-and-rocm-42-crashed-with-gfx803
Closing this issue as a stale build issue. If you have further issues please open up another issue. Sorry @supermar1010 @DumbledoreMD
I came pretty far with the build but then a segmentation fault was raised, not sure if this is arch specific.
Any ideas? Or should I post this on the tensorflow repo?