arm-sve Search Results - Githubissues

1000+ results
for arm-sve

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rust-lang/rustc_codegen_cranelift #1136

Please support non-2^N SIMD lane counts

The silicon that supports these more or less directly: GPUs handle Vec3s (`f32x3` typically) all the time already. Arm SVE supports 384-bit width vector registers and is available Soon™. RISCVV wil…

workingjubilee updated 1 year ago
4
ollama/ollama #7597

detect missing GPU runners and don't report incorrect GPU in…

### What is the issue? ``` $ ollama -v ollama version is 0.4.1 $ ollama run llama3.2-vision:latest $ ollama ps NAME ID SIZE PROCESSOR UNTIL …

kaleocheng updated 5 days ago
20
facebookresearch/faiss #1778

Would faiss be faster if SSE support added to Arm?

# Summary I found a project that converts Intel SSE intrinsics to Arm/Aarch64 NEON intrinsics ([sse2neon](https://github.com/DLTcollab/sse2neon)). Would faiss be faster if SSE support added to Arm …

gahoo updated 4 months ago
2
dotnet/runtime #94024

[API Proposal]: Arm64: FEAT_F32MM

```csharp namespace System.Runtime.Intrinsics.Arm; /// VectorT Summary public abstract partial class SveF32mm : AdvSimd /// Feature: FEAT_F32MM { public static unsafe Vector MatrixMultiplyA…

a74nh updated 3 months ago
6
llvm/llvm-project #76783

[VFABI] Add explicit element count information for scalable …

b1fba568f6d4d824554fa22f43ec0f689a265664 determines the VF from a mangling function by its scalar type size. It works for SVE, but maybe not work for RVV. Since RVV has register group concept, it make…

yetingk updated 10 months ago
2
pytorch/executorch #6844

Fail to Build on AArch64 Device Due to Undefined _Float16 ty…

### 🐛 Describe the bug Hi everyone, thanks for your effort on this issue. I'm trying to build executorch on my OrangePi 5 Pro board equipped with an 8 core ARMv8 CPU, But I encountered a compile …

1awrenceYang updated 1 week ago
2
microsoft/BitNet #50

Extremely slow

Extremely slow in CPU mode

seghier updated 4 weeks ago
3
llvm/llvm-project #56431

No efficient way to convert between fixed and vscale vectors…

LLVM has fixed and vscale vectors. Casting between them is generally not allowed. Use of ```llvm.vector.insert.*``` and ```llvm.vector.extract.*``` should be a way to accomplish a zero cost conversion…

zvookin updated 1 year ago
1
iains/gcc-14-branch #2

Apparent conflict between `-mtune=native` and `-march=native…

Reported first at https://github.com/fxcoudert/gfortran-for-macOS/issues/65 ``` meau /tmp $ gfortran-14 a.f90 -O3 -mtune=native -march=native f951: Error: unknown value 'apple-m1' fo…

fxcoudert updated 6 months ago
4
AnonymousYWL/TPDS #1

Instructions to run and execute the libshalom2 library

Hello @AnonymousYWL , Can you please provide instructions on how to use the libshalom2 library and also its gemm kernel API's? How to run a basic example code on your novel gemm kernel?

vineel96 updated 5 months ago
2

上一页 1...12 13 14 15 16 17 18...100 下一页

1000+ results for arm-sve

1000+ results
for arm-sve