-
At the moment the vectorisation lesson is very short and simply introduces the idea of not looping over arrays in Python.
It would be good to extend that lesson to talk more broadly about how to th…
-
add a progress line to say __% completed in accumulate function
vectorisation to speed up - instead of for loops
masking
## Problem A - vectorise the pre-processing
### step 1
- read the nump…
-
| | |
| --- | --- |
| Bugzilla Link | [31691](https://llvm.org/bz31691) |
| Version | 3.9 |
| OS | Linux |
| CC | @lesshaste,@hfinkel,@joker-eph,@RKSimon |
## Extended Description
When running the…
-
**Is your feature request related to a problem? Please describe.**
At the moment we have a rather spaghetti-based situation whereby the user specifies which functor they would like compiled. At the m…
-
To ensure we exploit vectorisation, need to call the forfft routine on multiple vectors.
-
GCC12 vectorises the statements in both the outer and inner loop. Clang doesn't do any vectorisation. As a result, we are about 90% behind for kernel s235 in TSVC.
Compile this input with `-O3 -mcp…
-
This is either a follow-up to #188 or a bug report: I have been using quadpy for some time now to compute line integrals of vectorised functions adaptively using `quadpy.line_segment.integrate_adaptiv…
-
We've noticed that Kokkos 2.8.00 and Intel 2019 does not produce vectorised code on KNL for CloverLeaf. If we build everything for SKX, we *do* see vectorisation occurring.
## Building Kokkos
We b…
-
While doing some testing I noticed that density, cdf and quantile sometimes return vectors, sometimes lists:
```r
library(distributional)
density(dist_normal(), c(0,1)) # re…
-
We are generating a lot of code with Clang for a loop that contains an if-then statement resulting in predicated instructions, which don't seem to be necessary looking at GCC's codegen. For this kerne…