-
The current repository contains a copy of SSE2NEON, which is quite old and non-maintained. [DLTcollab/sse2neon](https://github.com/DLTcollab/sse2neon) is arguably the most active fork of existing SSE2…
jserv updated
4 years ago
-
Hi,
I successfully compiled and installed StreamFX (bundled with obs-studio) on Linux-arm64 (details below). Doing so I encountered and temporarily fixed three issues with the Linux build process tha…
-
I tried compiling on aarch64 but got the following error:
```
In file included from ./mupen64plus-rsp-paraLLEl/arch/simd/rsp/rsp_common.h:15,
from mupen64plus-rsp-paraLLEl/state.…
-
The NEON-mapped function _mm_sqrt_ps does not handle zero inputs properly. Instead of returning 0, it returns a NaN value. This is caused by the underyling function 'vrsqrteq_f32' which returns +inf. …
-
This is follow up discussion from https://github.com/urho3d/Urho3D/pull/950#issuecomment-148293525.
Need proper instrumentation tool to measure if this approach is beneficial at all.
-
new function:
// added by wangyongxin
// Shift packed 64-bit integers in a left by imm8 while shifting in zeros, and store the results in dst. https://software.intel.com/sites/landingpage/Intrinsics…
-
Hi,
Thank you for your work to convert SSE instructions to NEON! Do you know how to convert _mm_madd_epi16 to neon instructions? Many thanks!
Best Regards,
Frank
-
There is no license file in the repository, and I didn't see any mentioned in the source. Is there a particular license this is supposed to be released under?
-
- [x] `_mm_hsub_ps`
- [x] `_mm_avg_epu16`
- [x] `_mm_avg_epu8`
- [x] `_mm_cvtpd_epi32`
- [x] `_mm_cvtps_epi32`
- [x] `_mm_madd_epi16`
- [x] `_mm_movemask_epi8`
- [x] `_mm_maskmoveu_si128`
- […
p0nce updated
4 years ago
-
Hi All,
I want to use the Intel camera sr300 in Odroidxu4, therefore I need to convert some functions from SSE to NEON, which are at least four functions:
- _mm_setr_epi8
- _mm_shuffle_epi8
- _mm_sto…