-
With stabilization of SIMD intrinsics in Rust 1.27 we can remove `simd` feature from the `blake2` crate.
-
Compile existing x86 SSE/AVX SIMD code into WASM SIMD is very attractive, developer can reuse existing library without rewrite it.
However currently only 128-bit subset of the AVX intrinsics are supp…
-
Trigonometric functions, ex. `_mm256_sin_ps`, are now only available when using intel compiler and intel svml library found. We could integrate the intrinsics below into `vec.h` to improve the perform…
-
# Reading past an array bounds is unsound
While you are correct that at the machine code level, one can read past an array bounds without invoking UB -- because at the machine code level, there is …
-
I tried this code:
```rust
#![feature(avx512_target_feature)]
#![feature(stdarch_x86_avx512)]
use std::arch::x86_64::*;
#[no_mangle]
#[target_feature(enable = "avx512bw")]
unsafe fn inlin…
-
Hi, Tim. Thank you for bring such a cool quantization tool to the community. However, I met a issue when trying to compile bnb from source code on PPC64LE CPUs of my school.
The errors regarding x8…
-
We build our releases on an old distribution (Debian Buster) because that's an easy way to produce binaries compatibles with older distros. Especially, we do that to link against an old libc and then …
-
For example, use SIMD intrinsics explicity, or use long long to process 8 bytes together?
-
We should have new for all datatypes, this requires also to have `vld1` for all datatypes.
https://developer.arm.com/architectures/instruction-sets/simd-isas/neon/intrinsics?search=vld1 for referen…
-
I am trying to cross compile for android AARCH64.
```
./configure CC=aarch64-linux-android-clang CXX=aarch64-linux-android-clang++ \
LDFLAGS=-L/tmp/openssl-android/lib LD=aarch64-linux-android-ld…