-
Enabling core.simd:
- [x] We can enable core.simd usage with DMD today, without even using D_SIMD, which brings the performance gap of LDC vs DMD from 20x to 4x. DMD binaries that makes heavy usage o…
-
I'm compiling Ozz Animation for AVX and I noticed in simd_math_sse-inl.h that with only OZZ_SHUFFLE_PS1() is specialized for AVX. I thought there might be opportunities to implement > SSE2 intrinsics …
-
AVX-512 has some nice features, such as support for fast float16 operations. This might allow us to do rescoring very fast.
The Quicker ADC paper also mentions some uses of AVX-512: https://arxiv.org…
-
Trying to build current git master (a37d4836519517bdce6cb9d956092321eca3e73b) as a [Universal Binary](https://en.wikipedia.org/wiki/Universal_binary) (or even just arm64 only) on a Intel Mac fails to …
-
about branch: *master*
### Background
So my question is about "extending" Vc, i.e. wrapping some intrinsics into Vc functions. So, say, I have some C functions:
```cpp
extern "C" {
#if defined(…
-
All architectures currently target by the library have instructions to perform a single NR-iteration of a reciprocal square root (and in some also an exact computation):
* x86, x86_64: rsqrtps (SSE…
-
I'm creating a recipe of OpenMVG for conan package manager and I had a hard time to successfully cross-build from macOS Intel to macOS M1.
Currently OpenMVG automagically enables SSE2 intrinsics (f…
-
`sse2neon` aims to support SSE, SSE2, SSE3, SSSE3, SSE4.1, SSE4.2 and AES extension, and AVX intrinsics would be excluded.
@danlark1 pointed out:
> Technically speaking, `_mm_fmadd_ps` is not an S…
jserv updated
9 months ago
-
| | |
|--------------------|----|
| Bugzilla Link | [PR31446](https://bugs.llvm.org/show_bug.cgi?id=31446) |
| Status | NEW |
| Importance | P enhancemen…
-
| | |
|--------------------|----|
| Bugzilla Link | [PR14268](https://bugs.llvm.org/show_bug.cgi?id=14268) |
| Status | NEW |
| Importance | P enhancemen…