-
I encountered the problem with GCC, in function silk_NSQ_del_dec_neon and silk/arm/NSQ_del_dec_neon_intr which invokes undefined behavior [-Waggressive-loop-optimizations], the errors occurred with Ra…
-
Given this code:
```c++
#include
void f (uint64x2_t *__restrict__ y, uint32x2_t x[4]) {
for (int i = 0; i < 4; ++i) {
for (int j = 0; j < 4; ++j) {
y[i * 4 + j] = vmull_u32(x[i], …
-
My app performs many small dgemms, each invoked by a separate thread (via a task pool). As recommended I compiled OpenBlas 3.10 with USE_THREAD=0 and USE_LOCKING=1. This is on Cavium ThunderX2 with…
-
| | |
|--------------------|----|
| Bugzilla Link | [PR27107](https://bugs.llvm.org/show_bug.cgi?id=27107) |
| Status | NEW |
| Importance | P normal |
|…
-
Since #851 seems to have been resolved by extendr dev version, we should be able to revert the workaround #874.
-
Hi,
is it planned to support ARM64 for android in the future? Or is it already possible, i did not see it...
-
Continuing from #22
> >The problem is, that doesn’t work with this code. We immediately get an illegal instruction error. You can see this, e.g. if you pull down the latest piscem from bioconda.
…
-
| | |
|--------------------|----|
| Bugzilla Link | [PR43719](https://bugs.llvm.org/show_bug.cgi?id=43719) |
| Status | NEW |
| Importance | P normal |
|…
-
I'm experimenting with making a project build as a Universal Binary, and of course that means our dependencies would ideally be built that way as well. (I know there are solutions to avoid this, but h…
-
There is a saturating doubling multiply add instructions in NEON and SVE - `vqdmla`, `svqdmla` which are equivalent to `hn::SaturatedAdd(hn::Mul(2, hn::Mul(a_real, b_imag)),hn::Mul(2, hn::Mul(a_imag, …