Closed ambrad closed 6 years ago
On KNL one sees:
HOMMEXX AVX_VERSION: 512
HOMMEXX VECTOR_SIZE: 8
HOMMEXX vector tag: AVX
On GPU one sees:
HOMMEXX AVX_VERSION: 0
HOMMEXX VECTOR_SIZE: 1
HOMMEXX vector tag: SIMD
(edited to contain just the relevant lines)
GPU and KNL tests pass. I ran full the full single-node performance scripts on V100 and SKX and confirmed performance is correct.
I'm removing inlining from this PR, as it hurts ARM performance.
Occurred to me there's a string function in the vector classes, IIRC. Will look tmw. Could replace the functions with Scalar::str() or whatever it's called.
Print AVX or SIMD at startup.
Force SIMD calls to be inline.