arm-sve Search Results - Githubissues

1000+ results
for arm-sve

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #123835

Package manager install on Nvidia Grace Hopper does not make…

### 🐛 Describe the bug Following any package manager install then running: ` import torch torch.cuda.is_available() ` returns false If I build a wheel from source it is successful, but the …

PipGrylls updated 7 months ago
3
llvm/llvm-project #109526

[sve][abi] gcc and llvm don't have the same call conventions

* test: https://gcc.godbolt.org/z/5o8YofrrG ``` class SimdFloat { private: typedef svfloat32_t simdInternalType_ __attribute__((arm_sve_vector_bits(512))); public: SimdFl…

vfdff updated 1 week ago
7
DynamoRIO/dynamorio #6662

Add new record filter tool for public release of traces

We want to create a new tool to filter traces of Google workloads for public release. The new public Google workload traces will contain more information compared to the previous version, while still…

edeiana updated 7 months ago
4
BLAKE3-team/BLAKE3 #315

blake3 single thread is slower than sha256 on Apple silicon

On intel cpus I see ~10x speedup for the C implementation, but on Apple silicon it is 1.3x times slower than sha256. I built the C version both with cmake and manually (based on README.md), both s…

nirs updated 9 months ago
3
ollama/ollama #6946

llama runner process has terminated: exit status 0xc0000005

### What is the issue? It's again the https://github.com/ollama/ollama/issues/6011 issue. **The issue is with embedding call with the model converted using convert_hf_to_gguf.py.** litellm.ll…

viosay updated 4 weeks ago
5
dotnet/runtime #107537

ARM64-SVE: ExtendWidening ConditionalSelect tests failing

``` ❯ DOTNET_TieredCompilation=0 $CORE_ROOT/corerun ./artifacts/tests/coreclr/linux.arm64.Checked/JIT/HardwareIntrinsics/HardwareIntrinsics_Arm_ro/HardwareIntrinsics_Arm_ro.dll Sve_ZeroExtendWidening…

a74nh updated 1 month ago
3
dragonwell-project/dragonwell8 #682

[nightly][aarch64]jdk/jfr/javaagent/TestLoadedAgent.java随机报错…

https://tone.aliyun-inc.com/ws/xesljfzh/test_result/368659?tab=1 【环境准备】 ``` BINARY_URL=oss://compiler-ci-bucket/dragonwell8/CI/tar/20240907-002919-492-#541-linux.aarch64.release.master-6b6e5240…

sendaoYan updated 2 months ago
2
xtensor-stack/xsimd #799

AArch64 Neon float16x4

As I understand, currently xsimd does not support vectors of 16-bit floats. Their support has been recently added by AVX512-FP16, but I'm particularly interested in `float16x4` of ARM Neon. What would…

dmikushin updated 2 years ago
2
google/highway #1903

OrderedDemote2To() f64->f32 ?

I'm migrating my DSP codebase from my own attempt of a library to Highway at the moment. Things went mostly well but I found one thing a bit puzzling: I have some algorithms that work on float lanes, …

Pflugshaupt updated 11 months ago
3
riscvarchive/riscv-gcc #360

"remove -mrvv compile option" make newlib library code has s…

library newlib is built with -O2 by default, the -O2 option imply auto-vectorization -mriscv-vector-bits=128, this would make library code generate vector code that only work for spike --varch=vlen:1…

chihinko updated 1 year ago
2

上一页 1...22 23 24 25 26 27 28...100 下一页

1000+ results for arm-sve

1000+ results
for arm-sve