Open baoleai opened 1 year ago
@zhenying-liu WDYT?
https://github.com/openxla/xla/pull/4970 probably fixed that. Can you check that your version of XLA include this fix. Otherwise, it would be a variation of the same issue.
If that isn't the case, the error tell:
2023-09-26 23:49:25.317533: W external/xla/xla/stream_executor/gpu/redzone_allocator.cc:322] UNIMPLEMENTED: ptxas ptxas too old. Falling back to the driver to compile.
Relying on driver to perform ptx compilation.
ptxas too old
Can you update ptxas? Which version of CUDA do you use? Can you update it?
The XLA version is at https://github.com/openxla/xla/commit/7a371ed44aba34f83d6d3d1159d2e6d0d327c603 which including #4970, and cuda version is 11.8.0 with cudnn8. And when I revert #4970 , I no longer get the above error.
After #4970, there is an error on H800 when using torch_xla.