-
### What happened?
When I try to use an instruct model with latest llama cpp, model does not load. Also the instruct flag is nowhere to be found.
### Name and Version
b3384
### What operating syst…
-
Expanding pixel quads from 2x2 to 4x4 could open up the possibility of AVX2 or even AVX512 optimizations. The edge and gradient equations make this tricky, as well as cubemap sampling (they all set in…
-
### 🐛 Describe the bug
Hello PyTorch team!
First of all, thank you very much for your great work on PyTorch 2. Its amazing! Please keep it up!
I am writing because I need help with a strange p…
-
### Describe the bug
I tried `torch.linalg.svd` a Max Series GPU using the Intel Devcloud and packages from the `intel` conda channel, and while I cannot reproduce the segfault, the performance on …
-
...even after building with
cmake -DRWKV_AVX=OFF -DRWKV_AVX2=OFF -DRWKV_AVX512=OFF -DRWKV_FMA=OFF -DBUILD_SHARED_LIBS=ON .
cmake --build . --config Release
This is on an N4200 cpu (has SSE2, SSSE…
-
为了兼容更多cpu 的指令集加速,acx2 一般是指2012 年后的cpu 了 ,在这之前的cpu 性能更差,感觉更需要这个,请问有什么办法能支持 SSE 指令集计算加速吗
-
Currently SIMD_LEN is set to 32 for everything. Should we just let users pass it for every method? Something like this:
```Rust
arr.iter().all_equal::()
```
-
I've compiled your `command` and `main` example projects with latest 16-th Clang on Windows.
I have CPU `Intel i7-2630QM @ 2.00GHz`, which has 4 cores (8 hardware threads). And CPU has AVX support.…
-
### Your current environment
```text
Collecting environment information...
PyTorch version: 2.3.0+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
…
-
### Environment
| Hardware | description |
|----------|---------------|
| GPU | Radeon RC5700XT |
| CPU | Ryzen |
| Software | version |
|----------|---------|
| OS | A…