vnni Search Results - Githubissues

1000+ results
for vnni

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tesseract-ocr/tesseract #3512

Use Intel VNNI for int dot product

There are two variants: * AVX512_VNNI (Tiger Lake, Rocket Lake) - 512bit/256bit/128bit * AVX_VNNI - (upcoming Alder Lake) - 256bit/128bit VNNI replaces 3 simd instructions with one instruction. …

amitdo updated 2 years ago
17
google/deepsomatic #33

ONT failed

Hi, I used deepsomatic(v.1.6.1) to call somatic mutations using ONT reads. But the process failed. My parameter is --model_type=ONT_R104 **errors:** ``` 2024-12-01 05:55:20.276827: I tensorflow/core/…

DayTimeMouse updated 14 hours ago
2
tensorflow/tensorflow #76718

Segmentation fault (core dumped) in `tf.profiler.experimenta…

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version 2.18.0-dev20240925 ### Custom code Yes ### OS platform and distributi…

x0w3n updated 1 month ago
1
tesseract-ocr/tesseract #3895

Use Arm Neon equivalent instructions to Intel VNNI

... for int dot product.

amitdo updated 2 years ago
3
llvm/llvm-project #97271

[X86] VNNI intrinsics argument types don't match the actual …

For example: `__m128i _mm_dpbusd_avx_epi32 (__m128i src, __m128i a, __m128i b)` This takes 1 x "src" and 2 x "a * b" multiplication inputs but the clang/llvm intrinsics are defined as: ``` TA…

RKSimon updated 5 months ago
1
neurospin/HSF #1

Sparse-quantized model runs without VNNI acceleration

**Describe the bug** Hi Dr @clementpoiret! Now that you have graduated :tada: here is a technical issue to keep you busy :wink: On a workstation with AVX512 and VNNI CPU capabilities, I am gett…

ylep updated 1 year ago
5
intel/graph-compiler #320

bf16 matmul's corresponding `tensor.pack` not properly optim…

Currently, the following 2 single-layer MLP have worst performance compared with GC v1. dtype | batch size | hidden list | GC V1 | 8c55a0544 remove brgemm read lock …

yifeizh2 updated 2 months ago
4
libxsmm/libxsmm #753

GEMM tester (xgemm) has convoluted format specification, not…

Right now we the xgemm driver has only 2 flags for formats (a trans and b trans). However with latest additions we have various VNNI factors and formats. We therefore we need at least 6 flags: a-tr…

alheinecke updated 1 year ago
1
plaidml/tpp-mlir #829

Update libxsmm-dnn with new argument for VNNI^T

As noted here: https://github.com/libxsmm/libxsmm-dnn/issues/29#issuecomment-1871502920

rengolin updated 10 months ago
1
archspec/archspec-json #121

Add support for Emerald Rapids CPUs

Support for Intel Emerald Rapids CPUs would be useful. Here is the `lscpu` output for one such core; I will attempt to create a PR for adding this if I can determine everything needed. ``` process…

xandm updated 2 months ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for vnni

1000+ results
for vnni