Open egoodarzi opened 3 days ago
Indeed, there is a problem with avx code that gets executed on machines without AVX512, I will have a look there. Can you test the repo as it was on 16. Aug. 2024, I think you should be able to get that state via git checkout a7fdcc1
then compile and run that code. Run cuvista -info
to see available devices.
Thanks for your quick reply, i tried that commit and result was exactly the same.
Yes, I can see, will have to investigate
Have a look at the latest commit, should work now
Hi Rainer, Your project looks awesome! Unfortunately I ran into some issue on startup when I was trying to run it on Ubuntu 24.04 with default compiler toolchain, AMD CPU, Nvidia GPU, Cuda 12.6. Here are some details below, hope you can suggest a workaround of some sorts. Cheers. When application starts it crashes on this line:
Disassembly shows this instruction:
vpbroadcastq %rcx,%xmm0
Seems like an issue with avx2? dmesg log:
[Mon Oct 14 22:46:56 2024] traps: cuvista[204770] trap invalid opcode ip:55b63e186213 sp:7ffe421fcc20 error:0 in cuvista[55b63e15a000+17c000]
The processor has support for avx, avx2, but not avx512:
Any suggestions would be appreciated.