-
We observed some specific problems when going from CPUSummary.jl v0.1.8 to v0.1.14 at [Trixi.jl](https://github.com/trixi-framework/Trixi.jl). Everything is fine with the old version of CPUSummary.jl.…
-
## SIMD Wrapper for ARM NEON, Intel AVX512 & KNC
- Pre GSoC ( - 26 April)
- [X] [Study the HPX, libflatarray codebase](https://github.com/larryxiao/libflatarray/issues/1)
- Community Bonding Period …
-
I compared an older version from Nov 23 with Apr 24, and the older version is much faster.
total time = 6225.76 ms
vs
total time = 3817.54 ms
Same CPU, same compiler and settings, same test: …
-
### Your current environment
```text
PyTorch version: 2.3.1+cu121
Is debug build: False
CUDA used to build PyTorch: 12.1
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.04.3 LTS (x86_64)
GCC ve…
-
### 🐛 Describe the bug
Minimal reproducer:
```python
import torch
x = torch.ones(1).expand(2)
print(f"{x.is_contiguous()=}")
print(f"{x.to(memory_format=torch.contiguous_format).is_contiguous(…
-
### What happened?
llama.cpp使用QWen2.5-7b-f16.gg在310P3乱码
### Name and Version
./build/bin/llama-cli -m Qwen2.5-7b-f16.gguf -p "who are you" -ngl 32 -fa
### What operating system are you seeing the …
-
### Steps to reproduce the issue
```console
$ spack install openmpi@4.1.4 %gcc@7.3.0 +legacylaunchers +gpfs +pmi schedulers=slurm >> log.openmpi 2>&1
```
### Error message
Error message
==> In…
-
**Is your feature request related to a problem? Please describe.**
On master branch we can:
```c++
auto v = std::vector();
/* fill v */
file.createDataSet("/path", v);
```
But we will fai…
-
### Your current environment
Collecting environment information...
PyTorch version: 2.2.0
Is debug build: False
CUDA used to build PyTorch: 12.2
ROCM used to build PyTorch: N/A
OS: Ubuntu 22.0…
-
AVX-512 introduces new `zmm` and `k` registers. The Visual Studio Registers window can show these if you right-click in the window and choose "AVX-512". However, the values are "grayed out" when debug…