-
Sorry if this is the wrong place to raise this, am very new to WebAssembly. So please forgive my ignorance. 😅
WebAssembly is very exciting for those of us in scientific domains as it opens up the w…
-
Improve throughput by using multi-threading ([rayon](https://docs.rs/rayon/)).
In most cases, the simd functions are fast enough that we may not benefit from multi-threading. And some functions are…
-
requirements copied from parent #28
- usability
- suitable licence
- platform support (Linux/Mac/Win; x86_64)
- convenient installation/bundling with nupic
- functionality
- SSE instructions
…
-
### Proposed new feature or change:
Add CPU feature detection for SVE2. On wide CPU cores the Scalable Vector Extension has the potential to increase performance manyfold compared to NEON.
SVE2 is…
-
Here are potential tasks concerning parallel backends, please add more as you find new ideas or issues
## Vectorization and parallelism
- [ ] Make @guvectorize and @vectorize being able to paral…
-
The [Heterogeneous parallel programming with open standards using oneAPI and Data Parallel C++](https://www.w3.org/2020/06/machine-learning-workshop/talks/heterogeneous_parallel_programming_with_open_…
-
128-bit SIMD.js leverages a broad convergence across architectures. SSE through SSE4.2, NEON, Altivec, MSA, all largely lined up at 128-bit SIMD registers, mostly IEEE-754, a lot of commonality in the…
-
Hello,
Many thanks for this crate and it looks nice. My only question is it SIMD accelerated and paralleled, as it is in the AvxWindowedFMindex paper (your second citation), the AvxWindowedFMindex …
-
When compiling, I get the following error:
```
In file included from /net/hciserver03/storage/jschnell/software/gccbin/lib/gcc/x86_64-pc-linux-gnu/8.2.0/include/immintrin.h:41,
f…
koebi updated
5 years ago
-
We have a number of tests that are in C but still need to be written in Fortran:
For 4.5
- [x] test_target_enter_exit_data_depend.c (PR Merged)
- [ ] test_target_enter_exit_data_map_global_array.…