-
After #1542 , I noticed that code which would work fine with `FORCE_INSTANT_SUBMISSION` enabled on the CUDA backend would fail when launched on an OpenMP device.
In particular, the following code w…
-
Hi,
I'm trying to leverage #4577 in a project (scikit-learn) that has a mix of OpenMP and OpenBLAS built with the pthreads threading layer to make OpenBLAS use the OpenMP threadpool. Ideally we'd u…
-
OpenMP 4.0 introduced offloading to attached devices and made extensions in OpenMP 4.5 (future versions of OpenMP will likely only evolve the offload capability with more advanced features on top of a…
-
**Bug summary**
get_device().get_info() returns 18446744073709551615 for cpu device.
**To Reproduce**
e.g.
```cpp
spdlog::info("Device: {}", Globals::Queue.get_device().get_info());
auto globa…
-
When building MueLu with the OpenMP backend and the serial backend disabled, we ran into this error:
```...
from .../Trilinos/packages/muelu/src/Utils/MueLu_AggregationExportFactory…
-
**Describe the bug**
I'm trying to install Kokkos on Windows with OpenMP.
Here is my cmake commands:
```
cmake -S ".." -B . -DCMAKE_INSTALL_PREFIX="..\install_dir_openmp" -DKokkos_ENABLE_OPENMP=…
-
### System Information
OpenCV version: 4.5.5 & 4.7.0
Operating System / Platform: Ubuntu 22.04
Compiler & compiler version: GCC 11.3.0
### Detailed description
OpenMP parallel backend is ne…
-
Some of the items to refactor/address:
- [x] `parallel_for(RangePolicy)` using `#pragma omp parallel` for schedule(dynamic/static)
- [x] `parallel_for(MDRangePolicy)` using `#pragma omp parallel` f…
-
We have to figure out the best approach.
One approach is using our (to be implemented) MLIR backend (#4319) and possibly using an OpenMP MLIR dialect that AMD compilers might be able to offload.
…
-
### Description
I used joblib to speed up sklearn’s logistic regression, but didn’t actually get a performance.
When I use a documented case(https://docs.ray.io/en/latest/ray-more-libs/joblib.html…