Closed solidpixel closed 3 months ago
This change moves from NEON-style post-load widening, to SVE-style load-time widening. On Neoverse V1 this improves performance 1-5% (increasing with block size).
Also fixes a signed/unsigned mismatch warning.
This change moves from NEON-style post-load widening, to SVE-style load-time widening. On Neoverse V1 this improves performance 1-5% (increasing with block size).
Also fixes a signed/unsigned mismatch warning.