Closed remifontan closed 4 years ago
Thank you for the heads up on this, I will look into it tonight probably.
I've put it in a fix, new version up on crates.io, give it a try, if it looks good now I'll close this. Thanks again.
apologies, it took me a long time to get back.
works great, thanks :-)
I'm debugging some inconsistency between multiplying 2 i32x8 using the overload impl Mul and manually calling
S::mullo_epi32(...)
.looking at the source code, mullo_epi32 seems to be using
_mm256_mullo_epi32
, while the impl Mul seems to be using_mm256_mul_epi32
.In my case, the mullo seems to be computing the proper result...
This is with avx2, I haven't checked the other implementation.