-
_VideoBlit::blit_overlay_alphasrc_mmx()_ is the SIMD implementation of _VideoBlit::blit_overlay_alphasrc() using x86 MMX instructions. The MMX register `mm6` is used for an unpack but its value is nev…
-
I would like to propose a small change regarding CMake support so it can be more easily integrated into projects using `CMake` and more importantly use `targets` as it propagates the include path and …
-
# **Issue №2227 opened by *[illume](https://github.com/illume)* at 2020-10-22 09:48:55**
Need to figure out how to build this with runtime detection.
Otherwise using SSE2 could be an option.
…
-
**Is your feature request related to a problem? Please describe.**
IBM z architecture provides SIMD capabilities that can be utilized for zstd optimization similar to SSE and Neon SIMD optimizations …
-
There are some pieces that could benefit from SSE optimizations. For example, the roundoff clipper was sped up 3x in an experiment.
![](https://dl.dropboxusercontent.com/u/75238938/soloud_roundoff_sse…
-
See also #64.
* [ ] make operations work for bit vectors with offset
* [x] support more operations
- [x] `bitIndex` (#81)
- [x] `nthBitIndex` (#81)
- [x] `selectBits` (#82)
- [x] `excl…
-
Is there any chance getting generic versions of `Sse/Sse2.And/Or/Xor/AndNot()` in API like it was done for `Avx2.BroadcastScalarToVector128()` and `Avx2.BroadcastScalarToVector256()` ? (Not sure about…
-
mcm uses SSE intrinsics so it doesn't build on ARM. Maybe you could use the same approach the Skia folks used to generate good code for SSE, NEON, and CPUs with no SIMD instructions: https://skia.org/…
-
Now that SIMD intrinsics for x86 have been stabilized, it might be worthwhile to add explicit SIMD to accelerate unmasking. For example, autobahn-python [uses](https://github.com/crossbario/autobahn-p…
-
Having some form of acceleration would benefit everyone, which this module currently lacks.
Options:
1. core.simd -- Supported everywhere, I think.
2. intel-intrinsics DUB package -- Somewhat sup…
dd86k updated
10 months ago