-
```
OpenJPEG already uses SIMD code (only SSE) when __SSE__ is defined.
This is the case for x64 code by default (not under Visual Studio though)
because x64 CPUs all have SSE/SSE2. This is also the …
-
```
Description:
As title.
```
Original issue reported on code.google.com by `wuye9036` on 14 Dec 2011 at 7:11
-
### What is the problem this feature will solve?
As I am looking into ways to improve astropy performance, I can see that using the new'ish compiler feature of function multi-versioning should improv…
-
If the server crashes please give log/stacktrace
> Did you change something about xp handling which broke hyperion?
> Huh
> Wtf it works now
> sus
-
clang 18.1 (homebrew) generates memset_pattern16 function calls when assigning a float to multiple Neon f32x4 elements in an array. This causes a serious performance regression in the scenario outline…
-
### Describe the issue
Not sure you are also maintaining cdnjs (and other cdns) but I have opened a PR https://github.com/cdnjs/packages/pull/1870
starting v 1.19.0 the `/dist` in [npmjs](https://…
-
C++23 has `views::chunk`, which converts a view into a range of views of some size, plus a remainder. This is essentially what `simd_split` does on `std::simd`. Unfortunately the name `split` in `simd…
-
Discussed on Zulip: https://rust-lang.zulipchat.com/#narrow/stream/257879-project-portable-simd/topic/simd.3A.3AMask.20codegen.20on.20avx512
I tried this code ([Godbolt link](https://gcc.godbolt.or…
-
```
[ 79%] Building CXX object BaikalStandalone/CMakeFiles/BaikalStandalone.dir/Application/application.cpp.o
In file included from /usr/include/GL/glx.h:30:0,
from /home/User/Docu…
-
### Description
Test code:
```swift
func test_stdlib_8(_ input: SIMD8) -> Bool {
any(input .== SIMD8(repeating: 0x42))
}
```
Building this with `-O` produces:
```assembly
.LCPI1_0:
…
karwa updated
7 months ago