It works fine using the OpenMP backend but when I set the device number for the GPU using: export HIP_VISIBLE_DEVICES=0; export ROCR_VISIBLE_DEVICES=0 , I get the following HIP error message:
Number of CPU cores: 16
Host thread affinity policy - thread mapping on every core
Number of HIP devices in the system: 1
Matrix+Vectors loading:5.7e-05 sec
HIP error: invalid device function
File: /long_pathname_so_that_rpms_can_package_the_debug_info/src/extlibs/rocALUTION/src/base/hip/hip_vector.cpp; line: 845
My small helloGPU device query program works fine:
I am implementing rocalution into an application.
It works fine using the OpenMP backend but when I set the device number for the GPU using:
export HIP_VISIBLE_DEVICES=0; export ROCR_VISIBLE_DEVICES=0
, I get the following HIP error message:My small helloGPU device query program works fine:
What could be the problem?
EDIT: export HIP_VISIBLE_DEVICES=0; export ROCR_VISIBLE_DEVICES=0; HSA_OVERRIDE_GFX_VERSION=10.3.0
adding HSA_OVERRIDE_GFX_VERSION=10.3.0 fixes the problem.
Maybe rocalution (I am using 5.7.1 RHEL version) is not compiled to support my gfx1031 card?