-
Hi, is it possible to use either BLIS or MKL instead of OpenBLAS? I'm using a AMD EPYC 7543 and the performance without it is much faster, so I'm wondering if either of the two would help prompt eval …
-
### Describe the issue
Hi, I have following codes:
```python
import dpctl
import torch
import intel_extension_for_pytorch
xpu_num = len(dpctl.get_devices(backend="level_zero", device_type="g…
-
It was found that some tests hang/slow down significantly when running with portBLAS. Tested on Intel CPU/GPU, NVIDIA GPU
with DPC++ compiler from oneAPI Base Toolkit 2023.2 and opensource implementa…
-
`sycl::detail::pi` is used in:
https://github.com/oneapi-src/oneMKL/blob/6c5f7ea783a7fe828c0de3c064c3bc837727524d/src/blas/backends/cublas/cublas_scope_handle.cpp#L123
Please note there `detail`…
-
**Describe the bug**
I tried to run an application with GEOPM and I expected the GPUs to show no clients at the conclusion of a run instead a client remains, pointing to "geopmd".
**GEOPM version*…
-
# Summary
The example program crashes/aborts with following error:
./matrix_mul_mkl
Device: Intel(R) Iris(R) Xe Graphics [0x9a49]
Problem size: A (600x1200) * B (1200x2400) --> C (600x2400)
La…
-
I am using an AWS ARM c6gd.2xlarge Instance with Ubuntu and compiled the code for arrayfire using the Linux build instructions with CPU backend support. When running tests, the following had failures:…
-
Implement DGELSY in the [GE_Lapack_Method](https://github.com/vickysharma0812/easifem-base/blob/master/src/modules/Lapack/src/GE_Lapack_Method.F90)
DGELSY computes the minimum-norm solution to a re…
-
1. Profiling code for time with profiling techniques to find the parts to optimize (cProfiler etc.): https://machinelearningmastery.com/profiling-python-code/
2. MKL:
- https://pypi.org/project/mkl…
-
The below warning is raised from `cumulative_logsumexp` call during the first launch on CPU device:
```python
import dpctl, dpctl.tensor as dpt
dpctl.__version__
# Out: '0.17.0dev0+331.g1243edc8…