-
## Description
this is another blas library which is faster than openblas, and unlike it, it supports avx512 as well. although not as fast as mkl in x86 and x86_64 processors, but it outperforms othe…
-
Hi,
First of all - thanks for creating (and open-sourcing) this swift code! Looks great!
I was looking through the SIMD wrappers for `AVX512F` in `vector.h` and I noticed a few wrappers that re…
-
Followup from PARQUET-1840 for current benchmarks it seems that doing removing the memset somehow either has no impact or is slightly worse. We should investigate using SIMD operations to speed up sp…
-
We observed some specific problems when going from CPUSummary.jl v0.1.8 to v0.1.14 at [Trixi.jl](https://github.com/trixi-framework/Trixi.jl). Everything is fine with the old version of CPUSummary.jl.…
-
**Is your feature request related to a problem? Please describe.**
On master branch we can:
```c++
auto v = std::vector();
/* fill v */
file.createDataSet("/path", v);
```
But we will fai…
-
| | |
| --- | --- |
| Bugzilla Link | [34876](https://llvm.org/bz34876) |
| Version | trunk |
| OS | All |
| CC | @RKSimon |
## Extended Description
There are no EVEX versions of VBLENDPS/VBLENDPD…
-
Below code works when I am using Mixtral model from Ollama directly. But when I use the IPEX-LLM optimized Mixtral model, the tool does not work. This is an easy tool for testing functionality , whic…
-
### 🐛 Describe the bug
1. https://github.com/intel/torch-xpu-ops/issues/261 - closed
2. Tensor isn't pinned with DataLoader(..., pin_memory=True,..)
command:
```python
PYTORCH_ENABLE_XPU_FALLBA…
-
**LocalAI version:**
Using Docker image:
`localai/localai:latest-aio-gpu-hipblas`
**Environment, CPU architecture, OS, and Version:**
- Ubuntu 22.04
- Xeon X5570 [Specs](https://ark.intel.c…
-
| | |
| --- | --- |
| Bugzilla Link | [36191](https://llvm.org/bz36191) |
| Version | trunk |
| OS | Windows NT |
| CC | @topperc,@ZviRackover |
## Extended Description
```ll
define @mul…