Closed Myriachan closed 11 months ago
Thanks for the suggestion, I really appreciate it.
Since I am not an expert in SSE2, I'm hesitant to make a change, especially since you mention it would be a very minor (if any) improvement, and I'd rather err on the safe side.
So I'll close the issue, but feel free to let me know if you can measure any significant improvement with this (or any other change).
It'd be a very minor optimization, if it does anything measurable at all, but:
Could be this instead:
pabsb
is also SSSE3, so requirements don't change. The advantage here is thatpabsb
is non-destructive, which could produce slightly better code. You'd have to try it, though...