Closed antoine-morvan closed 1 year ago
Oops, this got rearranged by #4009 without updating the definitions in KERNEL.A64FX (which explains why it compiles on Graviton3). I'll do a PR in a minute, but the trivial fix is to copy kernel/arm64/KERNEL.ARMV8SVE to kernel/arm64/KERNEL.A64FX (and I think @mousius ' intention was for all the cpu-specific SVE KERNEL definition files to go away and just use KERNEL.ARMV8SVE as the individual BLAS kernels for SVE are the same - at least currently)
Cannot try as of today, but I guess you could already prepare for NVidia Grace (Neoverse V2 Cores: Armv9 with 4x128b SVE2 : https://resources.nvidia.com/en-us-grace-cpu/nvidia-grace-cpu-superchip)
Certainly - in fact any SVE2 would be great, even just based on CortexX1 for testing, but it's all a question of time and talent...
Hello,
I just tried building OpenBLAS 0.3.24 on A64FX (Fujitsu FX 700), but it fails saying some files are missing.
When using the make build process, it fails with :
When using the still experimental CMake build process, I get :
Please note that:
My systems runs
Here is a sample script reproducing the build steps (compiler env setup to tune) : https://gist.github.com/antoine-morvan/bf6168f51f7d3b0058f69a664a9d7217
Best.