Open akyrtzi opened 2 years ago
@sneves what do you think?
numpy fixed same issue here: https://github.com/numpy/numpy/pull/19362
This other MacOS bug makes me think that AVX-512 being effectively disabled there is probably for the best.
This other MacOS bug makes me think that AVX-512 being effectively disabled there is probably for the best.
This has been fixed in macOS 12.2 (21D49), according to https://github.com/golang/go/issues/49233
It seems to me that BLAKE3’s CPU feature detection is not working as expected. I have "Intel Xeon W” processor (on macOS) which shows that AVX512 is supported:
However the checks for
AVX512F
andAVX512VL
using thecpu_feature
function fail (it reports that these features don’t exist) so it dispatches to the AVX2 implementation instead of the AVX512 one. When I modify and force thecpu_feature
function to report that AVX512 is available, then the AVX512 implementation gets used and gives a further boost in performance.