-
even if there is no plan to implement arm-instructions, it is moslty possible to use the sse2neon lib to emulate sse instructions on ARM.
So you can use vectorclass on ARM
i found some things to b…
-
This might be a bug in clang but figure I'd report it here first.
I have a technique I use to clamp NaN values to zero.
It's pretty simple, you exploit the fact, nan > 0.0f == false
```c
#defi…
-
Great to see that you have ported/optimized quite a lot of plugins to macOS! Especially porting VCL2 to Neon is great. Have you tried to open pull requests for the plugins, so that your changes will b…
-
1. improve implementations of `_mm_shuffle_epi32` and `_mm_shuffle_ps`
-
# Summary
I found a project that converts Intel SSE intrinsics to Arm/Aarch64 NEON intrinsics ([sse2neon](https://github.com/DLTcollab/sse2neon)). Would faiss be faster if SSE support added to Arm …
gahoo updated
3 months ago
-
I'm also interested in porting VS plugins to macos and linux, especially Apple Silicon Macos. However, I faced great difficulty with hard-coded SIMD plugins, which failed to compile on non-x86 platfor…
-
Has anybody had success building for ARM, apple M1 in particular, and replacing the SSE2 instructions with NEON.
I tried using this library but was not fully successful. https://github.com/DLTcolla…
-
Am examining this on [Aarch 64](https://github.com/bkmgit/low-latency-crypto-areion/tree/arm-neon-attempt1). Would it be helpful to have a separate branch using sse2neon or have sse2neon supported in…
-
The `COMPATIBLE_MACHINE = "(arm|aarch64)"` usage in this recipe will not work for devices like the Ettus E310 since those values aren't in the `MACHINEOVERRIDES`. I'm sure this extends to other machi…
-
We imported all the implementations from [jratcliff63367/sse2neon](https://github.com/jratcliff63367/sse2neon) a long time ago, but [DLTcollab/sse2neon](https://github.com/DLTcollab/sse2neon/) has mad…