-
[EXL (EVA extended library)](https://eprint.iacr.org/2021/1505) supports general EXL vector sizes that are translated into CKKS ciphertext slots by the following:
* User specifies their arbitrary inp…
asraa updated
2 months ago
-
I made [a recipe](https://github.com/procxx/conan-boost-ex-simd) for fork of the removed Boost.SIMD library (I archived it too [here](https://github.com/procxx/boost.simd)) (not planned for conan-cent…
-
This likely caused poor assembly for `Simd::splat` in some circumstances:
https://rust-lang.zulipchat.com/#narrow/stream/257879-project-portable-simd/topic/Very.20bad.20.60u16x8.3A.3Asplat.60.20codeg…
-
```rust
#![feature(portable_simd)]
use std::simd::{u8x16, LaneCount, Simd, SimdElement, SimdInt, SimdPartialOrd, SupportedLaneCount};
fn splat(x: T) -> Simd
where
T: SimdElement,
…
-
Currently NanoRT does not utilize SIMD/AVX.
Also no quantized BVH support.
It'd be better to start to consider optimization and quantization.
Fortunately, recent CPU architecture(AlderLake, Z…
-
mentioned in #13 this would be a useful operation to have for (float) operations in cfavml itself. for subnormal or large numbers, computing it directly will lead to underflow or overflow. I'm in the …
-
Since we use many of our own types, we might be able to use vector and matrix types from System.Numerics instead of OpenTK types which may result in significant speedups with calculations.
We shoul…
-
We should document the various env vars (`{}ARROW_SIMD_LEVEL{`}, `{}ARROW_RUNTIME_SIMD_LEVEL{`}, `{}ARROW_USER_SIMD_LEVEL{`}, others?).
We should also document what the defaults are (and what that …
-
```
OpenJPEG already uses SIMD code (only SSE) when __SSE__ is defined.
This is the case for x64 code by default (not under Visual Studio though)
because x64 CPUs all have SSE/SSE2. This is also the …
-
```
OpenJPEG already uses SIMD code (only SSE) when __SSE__ is defined.
This is the case for x64 code by default (not under Visual Studio though)
because x64 CPUs all have SSE/SSE2. This is also the …