manodeep / Corrfunc

⚡️⚡️⚡️Blazing fast correlation functions on the CPU.
https://corrfunc.readthedocs.io
MIT License
164 stars 50 forks source link

New kernels for AVX-512 instruction sets for Knights landing #92

Closed manodeep closed 6 years ago

manodeep commented 7 years ago

KNL is now available on Stampede. KNL supports AVX-512F, AVX_512CD, AVX-512ER and AVX-512PF. Since the only division within an inner loop occurs in mocks/DDrppi_mocks, 512ER is only applicable there. There are no prefetch calls within the code yet, I will need to use Vtune to check if prefetch helps performance in the current incarnation.

Might as well add AVX2 abilities, since the login nodes for the KNL cluster are Haswell chips.

manodeep commented 6 years ago

Implementation exists in the avx512 branch. And being tracked within projects: AVX512 project