-
### 🐛 Describe the bug
Following any package manager install then running:
`
import torch
torch.cuda.is_available()
`
returns false
If I build a wheel from source it is successful, but the …
-
* test: https://gcc.godbolt.org/z/5o8YofrrG
```
class SimdFloat
{
private:
typedef svfloat32_t simdInternalType_
__attribute__((arm_sve_vector_bits(512)));
public:
SimdFl…
-
We want to create a new tool to filter traces of Google workloads for public release.
The new public Google workload traces will contain more information compared to the previous version, while still…
-
On intel cpus I see ~10x speedup for the C implementation, but on Apple silicon it
is 1.3x times slower than sha256.
I built the C version both with cmake and manually (based on README.md), both s…
nirs updated
9 months ago
-
### What is the issue?
It's again the https://github.com/ollama/ollama/issues/6011 issue.
**The issue is with embedding call with the model converted using convert_hf_to_gguf.py.**
litellm.ll…
-
```
❯ DOTNET_TieredCompilation=0 $CORE_ROOT/corerun ./artifacts/tests/coreclr/linux.arm64.Checked/JIT/HardwareIntrinsics/HardwareIntrinsics_Arm_ro/HardwareIntrinsics_Arm_ro.dll Sve_ZeroExtendWidening…
a74nh updated
1 month ago
-
https://tone.aliyun-inc.com/ws/xesljfzh/test_result/368659?tab=1
【环境准备】
```
BINARY_URL=oss://compiler-ci-bucket/dragonwell8/CI/tar/20240907-002919-492-#541-linux.aarch64.release.master-6b6e5240…
-
As I understand, currently xsimd does not support vectors of 16-bit floats. Their support has been recently added by AVX512-FP16, but I'm particularly interested in `float16x4` of ARM Neon. What would…
-
I'm migrating my DSP codebase from my own attempt of a library to Highway at the moment. Things went mostly well but I found one thing a bit puzzling: I have some algorithms that work on float lanes, …
-
library newlib is built with -O2 by default, the -O2 option imply auto-vectorization -mriscv-vector-bits=128,
this would make library code generate vector code that only work for spike --varch=vlen:1…