-
I would like to propose a small change regarding CMake support so it can be more easily integrated into projects using `CMake` and more importantly use `targets` as it propagates the include path and …
-
I followed the documentation to run the llama2-7b model (4-bit quantized) and also ran it on llama.cpp for comparison. I noticed that, except for nt=1, where there was a slight performance improvement…
-
- [x] Get to know architecture
- [x] Study programming interface
- [x] Coding
- [ ] Testing
-
Looking at your popcount code in
https://github.com/rizkg/BBHash/blob/6bb97c4218198d3e5dd60c7eadb5267a79959a6d/BooPHF.h#L170-L189
it's perhaps worthwhile noting that you could speed-up your popc…
-
We don't have a `core::arch::{x86_64,x86}` intrinsic for [`_mm_clflushopt`](https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#text=clflushopt&ig_expand=769). I think this would …
-
So, Intel recently released the Intel Key Locker specification, defining new functionality within new Intel CPUs for the AES cryptographic domain. These are the AESENC*KL, AESENCWIDE*KL, AESDEC*KL, AE…
-
The Toolchain Recommendations document specifies, for the purposes of ensuring consistency, numerous portions of a Clever-ISA toolchain not part of other documents (such as the Assembly Syntax Recomme…
-
-
Hi,
First of all - thanks for creating (and open-sourcing) this swift code! Looks great!
I was looking through the SIMD wrappers for `AVX512F` in `vector.h` and I noticed a few wrappers that re…
-
Hi
The HIP code of miniMDock (https://github.com/ORNL-PE/miniMDock/tree/sycl_dev ) is working perfectly in AMD systems, but on Intel-PVC systems, it builds successfully but giving wrong results. Th…