ashvardanian / SimSIMD

Up to 200x Faster Dot Products & Similarity Metrics — for Python, Rust, C, JS, and Swift, supporting f64, f32, f16 real & complex, i8, and bit vectors using SIMD for both AVX2, AVX-512, NEON, SVE, & SVE2 📐
https://ashvardanian.com/posts/simsimd-faster-scipy/
Apache License 2.0
988 stars 59 forks source link

Improve: 2x Faster L2 on Arm NEON #211

Closed ashvardanian closed 1 month ago

ashvardanian commented 1 month ago

Accelerated l2sq_i8_neon with vabdq_s8 and unsigned dot-product instructions from 17 GB/s to 33 GB/s.