Closed ghost closed 7 years ago
Are you testing on the development
branch? Perhaps that fixes some issues already. Furthermore, I would suggest not to give the -fraction
argument, such that the first set of tuning tests will run fully (those are the most likely parameters to yield good results anyway).
I was testing on master
, but a similar error shows up even on development
. It looks like it's segfaulting inside the compiler.
I don't know if the errors I get tuning the Xgemm with double precision are related to the above mentioned problem, but I will post it here first. `[ RUN ] Running Xgemm [ OK ] Completed Xgemm (25.6 ms) - 167 out of 213 device compiler error/warning: Error:E012:Insufficient Local Resources!
[ FAILED ] Kernel Xgemm failed [ FAILED ] catched exception: device compiler error/warning occurred
[ FAILED ] Xgemm; 0.0 ms; MWG 32; NWG 128; KWG 32; MDIMC 16; NDIMC 16; MDIMA 32; NDIMB 16; KWI 2; VWM 1; VWN 4; STRM 0; STRN 1; SA 1; SB 1;PRECISION 64; [ RUN ] Running Xgemm [ OK ] Completed Xgemm (17.0 ms) - 169 out of 213 `
Maybe it is related to the driver but compared with the version above my version is: Device name: 'Tonga' (OpenCL 2.0 AMD-APP (1800.8))
There is one thread related to this error at the khronos forums.
Is this still an issue with the latest AMD compilers? And do you think it is related to CLBlast at all?
I haven't seen updates to catalyst, so this should still be true.
In this thread https://github.com/clMathLibraries/clBLAS/issues/207 there is a solution to the issue you are describing. I managed to get thinks working after linking against libOpenCL provided by the driver NOT by the AMD APP.
Are you sure that issue is related? Also, you two might be discussing different issues: @akssri can you try @MigMuc's suggestion and link to a different libOpenCL and see if that works?
Still I think AMD's compiler shouldn't segfault, so it is probably a good idea to report this to AMD.
@MigMuc libOpenCL is currently only being provided by AMDAPP on my machine. It looks like the bug for clMathLibraries/clBLAS#207 has something to do with multiple CPU threads ?
@CNugteren Yes, this should definitely be AMDs task to fix; feel free to close this (or perhaps add some archival flag). I'll try to dump the kernel that is causing this issue and file a bug report with AMD.
Calling clblast_xgemm_tuner with
-platform 0 -device 0 -precision 64 (double) -m 1024 [=default] -n 1024 [=default] -k 1024 [=default] -alpha 2.000000 [=default] -beta 2.000000 [=default] -fraction 512.000000
results in aSIGSEGV
. I think there was a issue that was filed earlier that indicated that the cause of such segfaults was to be found in the driver. I'm currently usingCatalyst 15.12
with a R9 Nano.