-
It looks like that our vectorisation strategy is to have some in-loop reduction/dependencies for a simple reduction like this:
```
for (int i = 0; i < N; i++) {
sum += a[i];
```
Because w…
-
## Overview
All operations on arrays should be checked to see if they can be vectorised i.e. rewritten with numpy functions. Vectorised functions are typically faster than non-vectorised functions,…
-
This is either a follow-up to #188 or a bug report: I have been using quadpy for some time now to compute line integrals of vectorised functions adaptively using `quadpy.line_segment.integrate_adaptiv…
-
To ensure we exploit vectorisation, need to call the forfft routine on multiple vectors.
-
Hi. I am new to using GPU. I am working on adversarial machine learning and earlier I have used the Textattack library for one of my projects using Sklearn and Keras models. For that I created the cus…
-
Hi. I noticed an issue with the argument passing that has been introduced in https://github.com/inducer/loopy/pull/631 (I believe).
When we inline an inner kernel in a wrapper kernel, where the i…
-
GCC12 vectorises the statements in both the outer and inner loop. Clang doesn't do any vectorisation. As a result, we are about 90% behind for kernel s235 in TSVC.
Compile this input with `-O3 -mcp…
-
[Implements #13334 ]
As per https://discourse.llvm.org/t/rfc-scalable-vectorisation-in-linalg/, add support for scalable tile sizes:
```mlir
transform.structured.tile %0 [2, [4]]]
```
and sca…
-
I just found a bug because of a wrong `_vectorisation_idx`. And this is not the first time. I guess I never really knew how to treat it. So this should be checked once again.
**EDIT**
And same goe…
-
### Proposal
To encourage the use of Gymnasium and build up the RL community, I would propose that a large range of tutorials are created.
This is a list of tutorials that could be made
- [x…