neon-intrinsics Search Results

1000+ results
for neon-intrinsics

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/faiss #1778

Would faiss be faster if SSE support added to Arm?

# Summary I found a project that converts Intel SSE intrinsics to Arm/Aarch64 NEON intrinsics ([sse2neon](https://github.com/DLTcollab/sse2neon)). Would faiss be faster if SSE support added to Arm …

gahoo updated 3 months ago
2
mattdesl/mp4-wasm-encoder #2

Compiling code with NEON intrinsics to Wasm SIMD

> Ensure that WASM version of minih264 library is indeed taking advantage of SIMD (lots of NEON code that doesn't compile there) Really cool work :) I'm wondering if you tried https://emscripten…

ngzhian updated 3 years ago
2
pnggroup/libpng #372

link error on Intel macOS when building Universal Binary (ar…

Trying to build current git master (a37d4836519517bdce6cb9d956092321eca3e73b) as a [Universal Binary](https://en.wikipedia.org/wiki/Universal_binary) (or even just arm64 only) on a Intel Mac fails to …

seanm updated 1 week ago
30
llvm/llvm-project #16648

float instructions should only be lowered to NEON if precisi…

| | | | --- | --- | | Bugzilla Link | [16274](https://llvm.org/bz16274) | | Version | trunk | | OS | Linux | | Attachments | [Test file attached](https://user-images.githubusercontent.com/92601…

tobiasgrosser updated 1 month ago
3
llvm/clangir #589

AArch64 specific builtins/intrinsics

We currently don't emit ARM64 specific intrinsics/builtins, nor none for other arches as well. See `clang/lib/CIR/CodeGen/CIRGenBuiltinAArch64.cpp` for the paths full of asserts. The suggested way to …

bcardosolopes updated 1 month ago
6
llvm/llvm-project #43155

Inefficient code generated for NEON function computing GNU s…

| | | | --- | --- | | Bugzilla Link | [43810](https://llvm.org/bz43810) | | Version | trunk | | OS | Linux | | Attachments | [Archive of GNU hash function implementions and build/run scripts](https:…

rprichard updated 2 years ago
1
ARM-software/acle #216

[BUG] Missing intrinsics for AArch32 instructions VMLA.F16 a…

Alongside `VFMA.F16`/`VFMS.F16`, AArch32 offers `VMLA.F16`/`VMLS.F16` instructions which performs multiply-add operation **with** intermediate rounding. Importantly, the vector-by-vector lane form (e.…

Maratyszcza updated 1 year ago
1
rust-lang/miri #3172

Add support for aarch64 platform intrinsics

Currently this produces: ``` --> /Users/alex_gaynor/.rustup/toolchains/nightly-aarch64-apple-darwin/lib/rustlib/src/rust/library/core/src/../../stdarch/crates/core_arch/src/arm_shared/crypto.rs…

alex updated 2 hours ago
9
google/android-riscv64 #39

external/skia: optimization

lots of neon/sse intrinsics; we'll want similar for risc-v.

enh-google updated 7 months ago
5
microsoft/DirectXMath #39

Add XMVectorRound half away from zero alternative

The current `XMVectorRound` uses round-to-nearest (even) a.k.a. _banker's rounding_. This matches the implementation of the `_mm_round_ps` (SSE4) and `vrndnq_f32` (ARMv8 NEON) intrinsics rounding beha…

walbourn updated 3 weeks ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for neon-intrinsics

1000+ results
for neon-intrinsics