arm-sve Search Results - Githubissues

1000+ results
for arm-sve

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

llvm/llvm-project #97783

[TBAA] "long long" type causes incorrect optimization in GVN

There is a simple case, I get total different results with `long` type and `long long` type, as shown, **long type** ``` #include #include const int SIZE = 16; int main() { int datas[S…

huhu233 updated 3 months ago
4
llvm/llvm-project #60628

Remove RUN line duplication in mlir/test/Integration/Dialect…

When extending SparseCompiler integrations tests to run via SVE codegen (see https://reviews.llvm.org/D143514 and https://reviews.llvm.org/D121304), we effectively duplicated many of the RUN lines. …

banach-space updated 1 year ago
2
flame/blis #616

armsve: generic kernel and default cache values

As a user running on a node based on neoverse-v1 design, I'd like to us the armsve kernels with a better performance level than the neon-based ones. This issue is a follow' up of https://github.com…

egaudry updated 2 years ago
11
iree-org/iree-test-suites #2

Import matmul/conv/attention tests from iree/tests/e2e/

Could start with what already exists, including the C++ binaries files and CMake build system: * https://github.com/iree-org/iree/tree/main/tests/e2e/matmul * https://github.com/iree-org/iree/tree…

ScottTodd updated 3 months ago
3
dotnet/runtime #94425

[API Proposal]: Arm64: FEAT_SVE_SHA3

```csharp namespace System.Runtime.Intrinsics.Arm /// VectorT Summary public abstract class SveSha3 : AdvSimd /// Feature: FEAT_SVE_SHA3 { /// T: long, ulong public static unsafe Vector …

a74nh updated 3 months ago
2
spack/spack #31675

Installation issue: OpenBLAS on A64FX with Fujitsu compiler

### Steps to reproduce the issue ```console $ spack spec -l openblas@0.3.20%fj@4.8.0+ilp64 symbol_suffix=64_ Input spec -------------------------------- openblas@0.3.20%fj@4.8.0+ilp64 symbol_su…

giordano updated 1 month ago
1
dougallj/asil #4

Possible to add timing info?

This site is awesome (especially compared to Arm's official stuff which is impossible). Would it be possible to add frequency and throughput for instructions for some of the common architectures (like…

oscardssmith updated 2 weeks ago
1
llvm/llvm-project #103481

[AArch64] On Neoverse V2, transform ld4 into ld2 + uzp* sequ…

https://godbolt.org/z/K17nh31oG shows that `ld4` instruction could be transformed into `ld2 + uzp*` sequences and give equivalent program output (at least on little-endian systems) According to Neo…

mingmingl-llvm updated 3 months ago
8
dotnet/runtime #94426

[API Proposal]: Arm64: FEAT_SVE_SM4

```csharp namespace System.Runtime.Intrinsics.Arm /// VectorT Summary public abstract class SveSm4 : AdvSimd /// Feature: FEAT_SVE_SM4 { public static unsafe Vector Sm4EncryptionAndDecrypti…

a74nh updated 3 months ago
2
llvm/llvm-project #61363

[AArch64] Vectorize `memcmp/bcmp` expansion

Expansions of `memcmp(s1, s2, n)` for static `n

Kmeakin updated 1 year ago
1

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for arm-sve

1000+ results
for arm-sve