-
### Operating system
macos
### Compiler
apple-clang
### Steps to reproduce the behavior
```Shell
Run `vcpkg install libpng` with following triplet on arm64 ([https://github.blog/2023-1…
-
We recently switched to testing openBLAS on a project and are noticing some test case failures due to a matrix multiplication operation returning an incorrect result.
This issue has been observed …
-
Need all of:
- [x] div_euclid/rem_euclid
- [x] clamp
- [x] max/min
- [ ] rotate_left/rotate_right
- [x] swap_bytes/reverse_bits
- [x] saturating_add/saturating_sub
- [x] saturating_neg/saturati…
-
I think of different 2D convolution approach for signigically speed-up this plugin.
One possible speed-up of 2D-convolution with typical 8 or even 10 bit unsigned integer input data: To make not ke…
-
Paint.net always crashed upon opening, with the recent version (2024.8.15). Such error had not occured in the old version (2023.5.31 or before), so maybe it has something to do with this update?
Ex…
-
局面評価にSIMDを使いたい。
-
Not entirely sure what ive done here, kinda new to this and cant seem to install Torch, my end goal is to run this: https://github.com/karpathy/char-rnn because im interested. I used the instructions …
-
This is a spinoff of vectorisation issue #71 and a followup to the big PR #171.
(A preliminary observation: the clang vectorization still needs a cross-check, see #172)
A couple of observations …
-
we have code like this:
``` cpp
void my_memcpy(char * __restrict dst, const char * __restrict src, ssize_t n)
{
while (n > 0)
{
_mm_storeu_si128(reinterpret_cast(dst),
…
-
**Update:** (@embray) before anyone else comments on this issue please see this comment: https://github.com/sagemath/sage-windows/issues/57#issuecomment-841135674
On 1 of my 3 Windows computers, pl…