-
# Summary
I believe there are some missing gemm_batch implementations, looking at the oneMKL docs it seems this should support. A `gemm_batch` with, two half matrices as input, a float matrix out, an…
-
# Summary
Hi All,
As I get started with oneMKL, I am trying to get a minimal GEMM example up and running, following the [[Dense Linear Algebra](https://oneapi-src.github.io/oneMKL/domains/dense…
-
-
Need to test https://scan.coverity.com/ tool for (at least) C/C++ static code analysis.
Steps:
1. [Register your project as a new project](https://scan.coverity.com/projects/new). Make sure to restri…
-
D:\Users\12719\anaconda3\python.exe D:\Users\12719\PycharmProjects\efficient-kan\tests\test_simple_math.py
20%|██ | 20/100 [00:01
wza13 updated
4 months ago
-
I am having trouble running Llava model in the benchmark suite. I am getting an error saying the LlavaConfig is unrecognized, but I see LlavaConfig in the choices.
Here is what I did:
- Set LLA…
-
I know that CUDA is considered 'vanilla' but why not give [SYCL](https://chsasank.com/sycl-portable-cuda-alternative.html) a shot? We can finally have things work on all GPUs OOB. I benchmarked SYCL a…
-
### 🐛 Describe the bug
The message **Intel oneMKL ERROR: Parameter 4 was incorrect on entry to SGELSD.** is printed, followed by a `RuntimeError` indicating that an internal assertion has failed in `…
-
# Summary
I found on MTL iGPU, if I call FP16 gemm of onemkl (no matter using OneAPI 2024.0 or 2024.2), the program will crash, and if I call it many times, it will cause my machine to freeze direc…
-
Intel MKL has a useful function for (scaled) in-place transposition:
https://www.intel.com/content/www/us/en/docs/onemkl/developer-reference-fortran/2024-1/mkl-imatcopy.html
I raised the issue o…